Llm

Your weight: normal

all topics
  1. 0.
    0 points 1 sources 1 minutes ago cluster

    A user attempted to fine-tune a large language model (LLM) to write technical documentation in the style of 1990s software technical writers, using a personal, local model and a large corpus of written sources.

  2. 0.
    0 points 1 sources 1 minutes ago cluster

    Researchers at Hugging Face developed task-seeded synthetic Q&A generation for Nemotron pretraining, which adds structured learning signals to large-scale LLM development. This approach improved performance in various tasks, including code and commonsense understanding.

  3. 0.
    0 points 1 sources 1 minutes ago cluster

    Mnemo is a local-first AI memory layer for any Large Language Model (LLM), developed in Rust and utilizing SQLite and petgraph. It allows for persistent knowledge graph, entity extraction, and semantic retrieval.

  4. 0.
    0 points 1 sources 1 minutes ago cluster

    Researchers from Linyao Chen et al. propose a novel approach to mobility prediction using Large Language Model (LLM)-driven agents, aiming for efficiency and evidence-grounded results.

  5. 0.
    0 points 1 sources 1 minutes ago cluster

    Researchers propose Agentic Chain-of-Thought Steering, a method to improve the efficiency and controllability of large language model (LLM) reasoning. This approach uses a chain-of-thought mechanism to guide LLMs towards more efficient and accurate reasoning.

  6. 0.
    0 points 1 sources 1 minutes ago cluster

    Researchers propose a method to reduce bias in multimodal large language models (LLMs) by introducing perceptual perturbation and reward modeling. The approach aims to improve the fairness and accuracy of LLMs in judgment tasks.

  7. 0.
    How to Build a Shitty Robot (mariozechner.at)
    0 points 1 sources 1 minutes ago cluster

    A DIY project to transform a low-cost toy robot into a fun LLM-powered toy for kids, using readily available materials and simple soldering.

  8. 0.
    0 points 1 sources 1 minutes ago cluster

    AWS has introduced comprehensive observability for Amazon SageMaker AI LLM inference, providing insights into GPU utilization and LLM quality. This feature enables users to monitor and optimize their AI models for better performance and efficiency.

  9. 0.
    0 points 1 sources 1 minutes ago cluster

    jmaczan has open-sourced Tiny-vLLM, a high-performance LLM inference engine built in C++ and CUDA, making it a smaller version of vLLM. The project is available on GitHub, with 141 stars and 7 forks.

  10. 0.
    0 points 1 sources 1 minutes ago cluster

    Researchers demonstrate that AI inference on standard GPUs can reach speeds of 3,000 tokens per second, rivaling dedicated inference hardware, by optimizing the software stack through architecture/engine/kernel co-design.

  11. 0.
    llm-anthropic 0.25.1 (simonwillison.net)
    0 points 1 sources 1 minutes ago cluster

    Anthropic has released version 0.25.1 of its LLM access platform, including the Claude series. The update introduces a new model, Claude Opus 4.8, and a fast mode option for organizations with enabled accounts.

  12. 0.
    0 points 1 sources 1 minutes ago cluster

    Researchers propose a method to bound compositional incoherence in multi-component large language model (LLM) agents, showing that local coherence can be achieved without sacrificing global performance.

  13. 0.
    0 points 1 sources 1 minutes ago cluster

    Researchers from Yalun Dai et al. submitted a paper to arXiv AI on May 28, 2026, exploring data organization techniques for improved Large Language Model (LLM) training.

  14. 0.
    0 points 1 sources 1 minutes ago cluster

    A short paper titled 'Mind Your Tone: Investigating How Prompt Politeness Affects LLM Accuracy' was submitted to arXiv on October 6, 2025, by Om Dobariya and Akhil Kumar.

  15. 0.
    0 points 1 sources 3 hours ago cluster

    Norway acquired 2 petabytes of Huawei flash storage for large language model (LLM) training, upgrading its AI research capabilities.