Pattern Recognition

Your weight: normal

all topics
  1. 0.
    0 points 1 sources 1 minutes ago cluster

    Researchers propose GeM-NR, a method for editing nonrigid scene changes in multi-view scenarios, leveraging geometry-aware techniques.

  2. 0.
    0 points 1 sources 1 minutes ago cluster

    Researchers propose a model that learns from a child's egocentric input, combining visual and verbal data to improve learning capabilities. The model is designed to mimic a child's learning process, allowing for continual learning and adaptation.

  3. 0.
    0 points 1 sources 1 minutes ago cluster

    Researchers Lianghuan Huang et al. submitted a paper to arXiv, formalizing the binding problem in computer vision and pattern recognition. The paper, titled Formalizing the Binding Problem, explores the concept and its implications.

  4. 0.
    0 points 1 sources 1 minutes ago cluster

    Researchers propose a monotonic adaptive norm rescaling approach for long-tailed recognition, aiming to improve hyperparameter-friendliness in optimization.

  5. 0.
    0 points 1 sources 1 minutes ago cluster

    Researchers Kaidi Zhang and Guanxu Zhu proposed a novel view synthesis method using differentiable multiplane images, achieving fast and lightweight results.

  6. 0.
    0 points 1 sources 1 minutes ago cluster

    Researchers propose TunerDiT, a training-free progressive steering method for multi-event video generation using diffusion transformers. This approach enables efficient video generation without requiring extensive training data.

  7. 0.
    0 points 1 sources 1 minutes ago cluster

    Researchers introduce VideoMLA, a low-rank latent KV cache for minute-scale autoregressive video diffusion. This approach aims to improve video generation efficiency.

  8. 0.
    0 points 1 sources 1 minutes ago cluster

    Researchers propose LocateAnything, a vision-language grounding model that uses parallel box decoding for fast and high-quality results, outperforming existing methods in various tasks.

  9. 0.
    0 points 1 sources 5 hours ago cluster

    Researchers proposed a method to improve the capacity of multimodal large language models for subject-driven generation, used in text-to-image synthesis applications.

  10. 0.
    0 points 1 sources 5 hours ago cluster

    Researchers proposed a novel method, Channel-wise Vector Quantization, for efficient image processing based on vector quantization, aiming to reduce computational complexity in computer vision tasks.