
[2408.01800] MiniCPM-V: A GPT-4V Level MLLM on Your Phone
Aug 3, 2024 · More importantly, MiniCPM-V can be viewed as a representative example of a promising trend: The model sizes for achieving usable (e.g., GPT-4V) level performance are rapidly decreasing, …
openbmb/MiniCPM-V-2_6 · Hugging Face
Oct 14, 2025 · It achieves state-of-the-art performance on OCRBench, surpassing proprietary models such as GPT-4o, GPT-4V, and Gemini 1.5 Pro. Based on the the latest RLAIF-V and VisCPM …
GitHub - nuoan/MiniCPM-V2.6: MiniCPM-V 2.6: A GPT-4V Level …
It outperforms GPT-4o mini, Gemini 1.5 Pro and Claude 3.5 Sonnet in single image understanding, and advances MiniCPM-Llama3-V 2.5's features such as strong OCR capability, trustworthy behavior, …
ModelBest Releases MiniCPM-V 2.6, Matching GPT-4V in Edge Performance
Aug 7, 2024 · The MiniCPM-V 2.6, with 8 billion parameters, not only catches up to GPT-4V in overall performance but also marks the first time an edge model has completely surpassed GPT-4V in three …
Comparison with Other Models | OpenBMB/MiniCPM-o | DeepWiki
Apr 18, 2025 · This page provides a detailed comparison between MiniCPM models and other leading multimodal large language models, covering performance metrics, efficiency, architecture, and …
MiniCPM-V 2.6: A GPT-4V Level Multimodal LLMs for Single Image, …
Aug 7, 2024 · This model introduces significant enhancements in performance and new features tailored for multi-image and video understanding, achieving substantial advancements over its predecessor, …
minicpm-v - ollama.com
It achieves state-of-the-art performance on OCRBench, surpassing proprietary models such as GPT-4o, GPT-4V, and Gemini 1.5 Pro. Based on the the latest RLAIF-V and VisCPM techniques, it features …
GitHub - OpenBMB/MiniCPM-V: MiniCPM-V 4.5: A GPT-4o Level …
With a total of 8B parameters, this model outperforms GPT-4o-latest, Gemini-2.0 Pro, and Qwen2.5-VL 72B in vision-language capabilities, making it the most performant on-device multimodal model in the …
Hugging Face
MiniCPM-V 2.6 can process images with any aspect ratio and up to 1.8 million pixels (e.g., 1344x1344). It achieves **state-of-the-art performance on OCRBench, surpassing proprietary models such as …
Wall-Facing Intelligent Open Source MiniCPM-V 2.6 Edge AI …
Aug 7, 2024 · The introduction of the MiniCPM-V2.6 model is of significant importance for the development of edge AI. It not only enhances multimodal processing capabilities but also showcases …