by OpenGVLab
The OpenGVLab InternVL2-8B is a cutting-edge multimodal large language model that combines the InternViT-300M-448px vision encoder, an MLP projector, and the internlm2_5-7b-chat language model, enabling it to excel at tasks like image description, visual reasoning, and document analysis. Optimized for both efficiency and performance, it supports instruction-tuned multimodal interactions, achieves competitive results on benchmarks like MathVista, and can be deployed via frameworks like Hugging Face Transformers or LMDeploy, with options for 4-bit quantization to speed up inference.
Complete information about the vendor/provider of this AI application
1 considerations identified
Review recommended before use
These considerations are automatically identified based on publicly available information about the vendor and AI catalog data. Actual risks may vary based on your specific use case and implementation.
Get insights into risk by running assessments on this AI application.
Discover EU-based alternatives for this AI application.
Track, assess, and govern your AI applications with Anove.