Results for "LLM generation"
11 results found

DeepSeek Open-Sources Inference Optimizations With 60-85% Speed Gains
DeepSeek released open-source inference optimizations that accelerate LLM generation by 60-85%. The move aims to democratize fast AI inference.

Lowfat CLI Tool Cuts LLM Token Usage by 91.8%
A new open-source CLI filter called Lowfat claims to reduce LLM token consumption by over 91%, offering developers significant cost savings on AI API calls.

Code Style Impacts LLM Token Costs, Analysis Reveals
A developer's code formatting choices directly affect token consumption and API costs when using large language models. The findings highlight new optimization strategies.

Antigravity 2.0 Dominates First OpenSCAD 3D LLM Benchmark
Antigravity 2.0 tops the OpenSCAD Architectural 3D LLM Benchmark, demonstrating superior ability to generate valid 3D models from natural language prompts.

The New Complexity of Large Language Models
Large language models are growing more complex with new architectures and techniques. This shift has implications for performance, interpretability, and the future of AI research.

New Technique Losslessly Compresses KV Cache Up to 4x for Faster AI Inference
Speculative KV coding compresses key-value cache up to 4x without loss, potentially cutting memory costs and enabling larger models on existing hardware.

Why Some Experts Compare AI Chatbots to Religious Belief Systems
A growing number of researchers argue people treat large language models with faith-like trust, raising concerns about blind reliance on AI.

Multi-Agent LLM System Automates Vulnerability Discovery and Reproduction
Researchers built a multi-agent LLM system that autonomously finds and reproduces software vulnerabilities, promising faster security testing.

Study Finds Politeness in AI Prompts Can Impact Model Accuracy
Research reveals that prompt tone significantly influences LLM accuracy. Polite prompts may boost performance while impolite ones degrade it.

Why AI Evaluation Startups Struggle to Survive
Eval startups face commoditization, open-source pressure and weak business models. A look at why so many fail to scale.

LLMs Do Math Without Numbers: New Research Reveals Hidden Process
New analysis shows large language models solve arithmetic using pattern matching and embeddings, not explicit numbers. The findings challenge assumptions about AI reasoning.