A recent study has introduced a new benchmark to evaluate how effectively artificial intelligence can complete full remote work projects rather than isolated tasks. Using real freelance assignments across fields such as design, data analysis, video production, and software development, researchers assessed whether AI systems could deliver end-to-end results at a professional standard.
The findings show that even advanced AI models struggled significantly. While AI performed well on individual components, it failed to consistently produce complete, polished projects. Common issues included incorrect formats, incomplete outputs, and lack of coordination across multiple steps. Only a very small percentage of tasks were completed at a level comparable to human professionals.
The study concludes that AI is not yet capable of replacing human remote workers for complex projects. Instead, its current strength lies in supporting specific tasks and boosting productivity, reinforcing the idea that human oversight and judgment remain essential in professional work.