About 55,500,000 results
Open links in new tab
  1. [2408.01800] MiniCPM-V: A GPT-4V Level MLLM on Your Phone

    Aug 3, 2024 · More importantly, MiniCPM-V can be viewed as a representative example of a promising trend: The model sizes for achieving usable (e.g., GPT-4V) level performance are rapidly decreasing, …

  2. openbmb/MiniCPM-V-2_6 · Hugging Face

    Oct 14, 2025 · It achieves state-of-the-art performance on OCRBench, surpassing proprietary models such as GPT-4o, GPT-4V, and Gemini 1.5 Pro. Based on the the latest RLAIF-V and VisCPM …

  3. GitHub - nuoan/MiniCPM-V2.6: MiniCPM-V 2.6: A GPT-4V Level …

    It outperforms GPT-4o mini, Gemini 1.5 Pro and Claude 3.5 Sonnet in single image understanding, and advances MiniCPM-Llama3-V 2.5's features such as strong OCR capability, trustworthy behavior, …

  4. ModelBest Releases MiniCPM-V 2.6, Matching GPT-4V in Edge Performance

    Aug 7, 2024 · The MiniCPM-V 2.6, with 8 billion parameters, not only catches up to GPT-4V in overall performance but also marks the first time an edge model has completely surpassed GPT-4V in three …

  5. Comparison with Other Models | OpenBMB/MiniCPM-o | DeepWiki

    Apr 18, 2025 · This page provides a detailed comparison between MiniCPM models and other leading multimodal large language models, covering performance metrics, efficiency, architecture, and …

  6. MiniCPM-V 2.6: A GPT-4V Level Multimodal LLMs for Single Image, …

    Aug 7, 2024 · This model introduces significant enhancements in performance and new features tailored for multi-image and video understanding, achieving substantial advancements over its predecessor, …

  7. minicpm-v - ollama.com

    It achieves state-of-the-art performance on OCRBench, surpassing proprietary models such as GPT-4o, GPT-4V, and Gemini 1.5 Pro. Based on the the latest RLAIF-V and VisCPM techniques, it features …

  8. GitHub - OpenBMB/MiniCPM-V: MiniCPM-V 4.5: A GPT-4o Level …

    With a total of 8B parameters, this model outperforms GPT-4o-latest, Gemini-2.0 Pro, and Qwen2.5-VL 72B in vision-language capabilities, making it the most performant on-device multimodal model in the …

    Missing:
    • performance comparison
  9. Hugging Face

    MiniCPM-V 2.6 can process images with any aspect ratio and up to 1.8 million pixels (e.g., 1344x1344). It achieves **state-of-the-art performance on OCRBench, surpassing proprietary models such as …

    Missing:
    • performance comparison
  10. Wall-Facing Intelligent Open Source MiniCPM-V 2.6 Edge AI …

    Aug 7, 2024 · The introduction of the MiniCPM-V2.6 model is of significant importance for the development of edge AI. It not only enhances multimodal processing capabilities but also showcases …

    Missing:
    • performance comparison