blogposts

Blog Posts

Measuring Context Utilization on Recent Open-Source Long Context LLMs

This blog post aims to evaluate how well the most capable open-source long context large language models (LLMs) utilize context, using the Needle In A Haystack test. We focus on the task of chapter summarization for recently published books to minimize data contamination while ensuring a challenging test. Our discussion highlights the results of the test conducted on the Llama 3.1 70B Nemotron Instruct model, revealing performance variations across different context lengths and needle placement depths.

19 min read · April 28, 2025

2025
Sample Blog Post

Your blog post's abstract. Please add your abstract or summary here and not in the main body of your text. Do not include math/latex or hyperlinks.

15 min read · April 28, 2025

2025
Sample Blog Post (HTML version)

Your blog post's abstract. Please add your abstract or summary here and not in the main body of your text. Do not include math/latex or hyperlinks.

12 min read · April 28, 2025

2025