Nomic Embed Text v1 is an open‑source, fully auditable text embedding model
1m
10K+
4
397B-parameter MoE multimodal LLM with 17B active params, 262K context, 201 languages
3m
10K+
1
Qwen3 is the latest Qwen LLM, built for top-tier coding, math, reasoning, and language tasks.
8m
10K+
1
Granite Docling is a multimodal model for efficient document conversion.
9m
10K+
2
Embedding Gemma is a state-of-the-art text embedding model from Google DeepMind
10m
10K+
3
Devstral Small 2 is an FP8 instruct LLM for agentic SWE tasks, codebase tooling, and SWE-bench.
6m
10K+
4
Safety reasoning models for policy-based text classification and foundational safety tasks.
8m
10K+
2
GLM-4.7-Flash is a top 30B-A3B MoE, balancing strong performance with efficient deployment.
5m
10K+
3
Qwen3 Embedding: multilingual models for advanced text/ranking tasks like retrieval & clustering.
8m
10K+
2
Multilingual reranking model for text retrieval, scoring document relevance across 119 languages.
7m
10K+
3
OpenAI’s open-weight models designed for powerful reasoning, agentic tasks
8m
10K+
2
Multilingual reranking model for text retrieval, scoring document relevance across 119 languages.
7m
10K+
744B MoE language model with 40B active params for reasoning, coding, and agentic tasks (FP8)
4m
10K+
3
SmolVLM: lightweight multimodal model for video, image, and text analysis, optimized for devices
8m
10K+
2
Multimodal AI model with 35B MoE architecture for coding agents, reasoning, and vision tasks
2m
10K+
Designed for reasoning, agent and general capabilities, and versatile developer-friendly features
10m
10K+
2
24B multimodal instruction model by Mistral AI, tuned for accuracy, tool use & fewer repeats
9m
10K+
1
SmolVLM: lightweight multimodal model for video, image, and text analysis, optimized for devices.
9m
10K+
4
mxbai-embed-large-v1 is a top English embed model by Mixedbread AI, great for RAG and more.
1y
10K+
3
Granite-4.0-nano: lightweight instruct model trained via SFT, RL, and merging on diverse data.
8m
10K+
IBM's Granite 3.0 large language model (LLM), optimized for local large language model operations
1y
10K+
1
Agentic coding LLM (24B) fine-tuned from Mistral-Small-3.1 with a 128K context window
9m
10K+
4
FunctionGemma is a 270M open model for fine-tuned, offline function-calling agents on small devices.
6m
10K+
2
7B long-context instruct model with RL alignment, IF, tool use, and enterprise optimization.
9m
8.4K
4
An open-source visual language model that interprets images via text prompts, fast and powerful.
9m
7.8K
2
Granite Embedding Multilingual is a 278 million parameter, encoder‑only XLM‑RoBERTa‑style
11m
7.1K
4
FunctionGemma is a 270M open model for fine-tuned, offline function-calling agents on small devices.
6m
7.1K
1