gloq/models
Values
pub const deepseek_r1_distill_llama_70b: String
DeepSeek R1 distilled on Llama 70B — strong reasoning model.
pub const gemma2_9b_it: String
Gemma 2 9B — Google’s efficient instruction-tuned model.
pub const llama3_70b_8192: String
Llama 3 70B — legacy model, 8k context. Prefer llama_3_1_70b_versatile.
pub const llama3_8b_8192: String
Llama 3 8B — legacy model, 8k context. Prefer llama_3_1_8b_instant.
pub const llama_3_1_70b_versatile: String
Llama 3.1 70B — powerful, 128k context.
pub const llama_3_1_8b_instant: String
Llama 3.1 8B — fast, lightweight, 128k context. Good for high-throughput.
pub const llama_3_2_11b_vision_preview: String
Llama 3.2 11B Vision — multimodal model supporting image inputs.
pub const llama_3_2_1b_preview: String
Llama 3.2 1B — ultra-lightweight model for edge use cases.
pub const llama_3_2_90b_vision_preview: String
Llama 3.2 90B Vision — large multimodal model.
pub const llama_3_3_70b_specdec: String
Llama 3.3 70B with speculative decoding for faster output.
pub const llama_3_3_70b_versatile: String
Llama 3.3 70B — latest generation, 128k context. Recommended default.
pub const mixtral_8x7b_32768: String
Mixtral 8x7B MoE — 32k context. Strong at reasoning and code.
pub const qwen_qwq_32b: String
Qwen QwQ 32B — strong reasoning and long-context model.