Popular repositories Loading
-
-
SWE-bench_Pro-os
SWE-bench_Pro-os PublicSWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?
-
-
-
data-prefetch-link
data-prefetch-link PublicExtends next.js <Link> to allow invoking getInitialProps when prefetching a page
Repositories
- DrugDiscoveryBench Public
Opensource repository containing task and image data for DrugDiscoveryBench
scaleapi/DrugDiscoveryBench’s past year of commit activity - SWE-Interact Public
New testbed of interactive SWE tasks for coding agents, set in a realistic multi-turn developer driven environment
scaleapi/SWE-Interact’s past year of commit activity - nucleus-python-client Public
The official Python SDK for Nucleus, part of Scale API, the Data Platform for AI
scaleapi/nucleus-python-client’s past year of commit activity - terminal-bench-3-public Public Forked from harbor-framework/terminal-bench-3
🚧 Accepting Task Submissions 🚧
scaleapi/terminal-bench-3-public’s past year of commit activity - vero Public
VeRO is an evaluation harness for using coding agents to optimize LLM-based agents and workflows. It treats agent code as a versioned artifact — making changes, evaluating results, and hill-climbing toward better performance using git version control.
scaleapi/vero’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…