← Back to Vision Models LFM2-VL-3B is Liquid AI’s highest-capacity multimodal model, delivering enhanced visual reasoning and detailed image understanding. Ideal for complex vision tasks requiring deeper comprehension.Documentation Index
Fetch the complete documentation index at: https://liquidai-link-snapshot-contract.mintlify.app/llms.txt
Use this file to discover all available pages before exploring further.
Specifications
| Property | Value |
|---|---|
| Parameters | 3B |
| Context Length | 32K tokens |
| Architecture | LFM2-VL (Dense) |
Advanced Reasoning
Complex visual logic and analysis
Document Understanding
Detailed document and chart parsing
Multi-Image
Compare and reason across images
Quick Start
- Transformers
- vLLM
- SGLang
- llama.cpp