gloq/models

Values

pub const deepseek_r1_distill_llama_70b: String

DeepSeek R1 distilled on Llama 70B — strong reasoning model.

pub const gemma2_9b_it: String

Gemma 2 9B — Google’s efficient instruction-tuned model.

pub const gemma_7b_it: String

Gemma 7B — legacy Google model. Prefer gemma2_9b_it.

pub const llama3_70b_8192: String

Llama 3 70B — legacy model, 8k context. Prefer llama_3_1_70b_versatile.

pub const llama3_8b_8192: String

Llama 3 8B — legacy model, 8k context. Prefer llama_3_1_8b_instant.

pub const llama_3_1_70b_versatile: String

Llama 3.1 70B — powerful, 128k context.

pub const llama_3_1_8b_instant: String

Llama 3.1 8B — fast, lightweight, 128k context. Good for high-throughput.

pub const llama_3_2_11b_vision_preview: String

Llama 3.2 11B Vision — multimodal model supporting image inputs.

pub const llama_3_2_1b_preview: String

Llama 3.2 1B — ultra-lightweight model for edge use cases.

pub const llama_3_2_3b_preview: String

Llama 3.2 3B — small but capable.

pub const llama_3_2_90b_vision_preview: String

Llama 3.2 90B Vision — large multimodal model.

pub const llama_3_3_70b_specdec: String

Llama 3.3 70B with speculative decoding for faster output.

pub const llama_3_3_70b_versatile: String

Llama 3.3 70B — latest generation, 128k context. Recommended default.

pub const mixtral_8x7b_32768: String

Mixtral 8x7B MoE — 32k context. Strong at reasoning and code.

pub const qwen_qwq_32b: String

Qwen QwQ 32B — strong reasoning and long-context model.

Search Document