Llm

Your weight: normal

0.

Fine-tuning an LLM to write docs like it's 1995 (passo.uno)

0 points 1 sources 1 minutes ago cluster

A user attempted to fine-tune a large language model (LLM) to write technical documentation in the style of 1990s software technical writers, using a personal, local model and a large corpus of written sources.

ai llm natural-language-processing tech-writing
0.

Task-Seeded Synthetic Q&A Generation for Nemotron Pretraining (huggingface.co)

0 points 1 sources 1 minutes ago cluster

Researchers at Hugging Face developed task-seeded synthetic Q&A generation for Nemotron pretraining, which adds structured learning signals to large-scale LLM development. This approach improved performance in various tasks, including code and commonsense understanding.

ai llm startups
0.

Show HN: Mnemo – local-first AI memory layer for any LLM (Rust, SQLite, petgraph) (github.com)

0 points 1 sources 1 minutes ago cluster

Mnemo is a local-first AI memory layer for any Large Language Model (LLM), developed in Rust and utilizing SQLite and petgraph. It allows for persistent knowledge graph, entity extraction, and semantic retrieval.

ai llm petgraph rust sqlite
0.

Towards Efficient and Evidence-grounded Mobility Prediction with LLM-Driven Agent (arxiv.org)

0 points 1 sources 1 minutes ago cluster

Researchers from Linyao Chen et al. propose a novel approach to mobility prediction using Large Language Model (LLM)-driven agents, aiming for efficiency and evidence-grounded results.

agents llm mobility-prediction
0.

Agentic Chain-of-Thought Steering for Efficient and Controllable LLM Reasoning (arxiv.org)

0 points 1 sources 1 minutes ago cluster

Researchers propose Agentic Chain-of-Thought Steering, a method to improve the efficiency and controllability of large language model (LLM) reasoning. This approach uses a chain-of-thought mechanism to guide LLMs towards more efficient and accurate reasoning.

agents artificial-intelligence llm
0.

Mitigating Perceptual Judgment Bias in Multimodal LLM-as-a-Judge via Perceptual Perturbation and Reward Modeling (arxiv.org)

0 points 1 sources 1 minutes ago cluster

Researchers propose a method to reduce bias in multimodal large language models (LLMs) by introducing perceptual perturbation and reward modeling. The approach aims to improve the fairness and accuracy of LLMs in judgment tasks.

bias llm models multimodal perceptual-perturbation
0.

How to Build a Shitty Robot (mariozechner.at)

0 points 1 sources 1 minutes ago cluster

A DIY project to transform a low-cost toy robot into a fun LLM-powered toy for kids, using readily available materials and simple soldering.

ai diy llm robotics stem
0.

Comprehensive observability for Amazon SageMaker AI LLM inference: From GPU utilization to LLM quality (aws.amazon.com)

0 points 1 sources 1 minutes ago cluster

AWS has introduced comprehensive observability for Amazon SageMaker AI LLM inference, providing insights into GPU utilization and LLM quality. This feature enables users to monitor and optimize their AI models for better performance and efficiency.

ai llm machine-learning sagemaker
0.

Show HN: Tiny-vLLM – high performance LLM inference engine in C++ and CUDA (github.com)

0 points 1 sources 1 minutes ago cluster

jmaczan has open-sourced Tiny-vLLM, a high-performance LLM inference engine built in C++ and CUDA, making it a smaller version of vLLM. The project is available on GitHub, with 141 stars and 7 forks.

ai c cuda llm machine-learning
0.

Real-time LLM Inference on Standard GPUs: 3k tokens/s per request (blog.kog.ai)

0 points 1 sources 1 minutes ago cluster

Researchers demonstrate that AI inference on standard GPUs can reach speeds of 3,000 tokens per second, rivaling dedicated inference hardware, by optimizing the software stack through architecture/engine/kernel co-design.

ai chips gpu inference llm
0.

llm-anthropic 0.25.1 (simonwillison.net)

0 points 1 sources 1 minutes ago cluster

Anthropic has released version 0.25.1 of its LLM access platform, including the Claude series. The update introduces a new model, Claude Opus 4.8, and a fast mode option for organizations with enabled accounts.

anthropic llm models
0.

Locally Coherent, Globally Incoherent: Bounding Compositional Incoherence in Multi-Component LLM Agents (arxiv.org)

0 points 1 sources 1 minutes ago cluster

Researchers propose a method to bound compositional incoherence in multi-component large language model (LLM) agents, showing that local coherence can be achieved without sacrificing global performance.

agents artificial-intelligence llm
0.

Demystifying Data Organization for Enhanced LLM Training (arxiv.org)

0 points 1 sources 1 minutes ago cluster

Researchers from Yalun Dai et al. submitted a paper to arXiv AI on May 28, 2026, exploring data organization techniques for improved Large Language Model (LLM) training.

ai data-organization llm
0.

Prompt Politeness Affects LLM Accuracy (arxiv.org)

0 points 1 sources 1 minutes ago cluster

A short paper titled 'Mind Your Tone: Investigating How Prompt Politeness Affects LLM Accuracy' was submitted to arXiv on October 6, 2025, by Om Dobariya and Akhil Kumar.

computer-science llm prompt-politeness
0.

Norway's 2 petabytes of Huawei flash storage and LLM training (blocksandfiles.com)

0 points 1 sources 3 hours ago cluster

Norway acquired 2 petabytes of Huawei flash storage for large language model (LLM) training, upgrading its AI research capabilities.

ai huawei llm norway