API Performance Benchmark

Elastic Introduces Best-in-Class Embedding Models for High Performance Semantic Search

Elastic (NYSE: ESTC), the Search AI Company, today announced the availability of jina-embeddings-v5-text, a family of two small, Elasticsearch-native multilingual embedding models at 0.2B and 0.6B ...

Unite.AI

Easy Rewording Breaks AI Safety, Even for Gemini and Claude

AI safety tests found to rely on 'obvious' trigger words; with easy rephrasing, models labeled 'reasonably safe' suddenly fail, with attacks succeeding up to 98% of the time. New corporate research ...

Stark Insider

7 Ways to Stop Bleeding Money on AI API Calls

AI API calls are expensive. After our always-on bot burned through tokens, we found seven optimization levers that cut costs ...

Google Gemini 3.1 Pro Nearly Doubles Apex Agents Score to 33.5

Office Productivity: The Apex Agents benchmark, which evaluates productivity in office-like environments, saw Gemini 3.1 Pro ...

Backboard.io Becomes First AI Platform to Lead Both Major Memory Benchmarks, Accelerating the Era of Agentic AI

Backboard.io announced it has achieved state-of-the-art performance across both leading AI memory benchmarks, a first ...

Google’s Latest Gemini 3.1 Pro Model Is a Benchmark Beast

Google just released its most capable Gemini 3.1 Pro AI model that beats all frontier models on Humanity's Last Exam and ...

Aquant's 2026 Field Service Benchmark: Companies Can Unlock up to 26% in Service Cost Savings by Scaling Knowledge Across the Workforce

Aquant today released The 2026 Field Service KPI Benchmark Report, an industry-wide analysis of anonymized performance data from 161 service organizations. The report spans nearly 30 million service ...

Show inaccessible results

Elastic Introduces Best-in-Class Embedding Models for High Performance Semantic Search

Easy Rewording Breaks AI Safety, Even for Gemini and Claude

7 Ways to Stop Bleeding Money on AI API Calls

Google Gemini 3.1 Pro Nearly Doubles Apex Agents Score to 33.5

Backboard.io Becomes First AI Platform to Lead Both Major Memory Benchmarks, Accelerating the Era of Agentic AI

Google’s Latest Gemini 3.1 Pro Model Is a Benchmark Beast

Aquant's 2026 Field Service Benchmark: Companies Can Unlock up to 26% in Service Cost Savings by Scaling Knowledge Across the Workforce

Speechify's AI Voice Research Lab Launches SIMBA 3.0 Voice Model to Power Next Generation of Voice AI

Google launches Gemini 3.1 Pro, retaking AI crown with 2X+ reasoning performance boost

Nvidia pulls ahead as AMD's software stack falls short: report

OpenAI introduces EVMbench to measure AI crypto security