LifeArchitect.ai Models Table (Free Preview for Hugging Face 🤗)

Browse 900+ AI models with parameters, benchmarks, and metadata. Click column headers to sort.

███ = redacted for free preview. Upgrade to Models Table Pro for full access to all cells plus additional columns (training hardware, compute estimates, training cost, and more). All analysis by LifeArchitect.ai.


NVIDIA-Nemotron-Labs-3-Elastic-30B-A3B	Shanghai AI Laboratory/Se...	https://www.linkedin.com/blog/engineering/generative-ai/how-we-built-domain-adapted-foundation-genai-models-to-power-our-platform	0.0000119	0.00469	MatFormer	250,000	80,000:1	███	0.000	41.318	48.85	26.26	18.07	reddit outbound, web, dialogue	Jun/2026	███	███	https://www.prnewswire.com/news-releases/baidu-launches-ernie-4-5-turbo-ernie-x1-turbo-and-new-suite-of-ai-tools-to-empower-developers-and-supercharge-ai-innovation-302438584.html	Reasoning, Diffusion	Llama 2 for Southeast Asian (SEA) languages: Vietnamese 🇻🇳, Indonesian 🇮🇩, Thai ...	886	███	███	███	███	Non-commercial research	1,000,000	International	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███


Glimmer-1-Base	Glint Research	https://huggingface.co/Glint-Research/Glimmer-1-Base	0.0000119		Dense	0	43:1	███	0.000					web-scale	Jun/2026	🟢	A	███		11.9K-parameter (0.0000119B) experimental micro-model trained on 500K tokens of ...	886	███	███	███	███	MIT	███		███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███
GLM-5.2	Z.AI	https://huggingface.co/zai-org/GLM-5.2	744	40	MoE	28,500	39:1	███	15.3	███		91.2	54.7	synthetic, web-scale	Jun/2026	🟢	A	https://arxiv.org/abs/2602.15763	Reasoning	1M-token context (up from 200K in GLM-5.1); 131K max output. Trained entirely on...	885	███	███	███	███	███	1,000,000	China	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███
VibeThinker-3B	WeiboAI (Sina Weibo)	███	3		Dense	5,500	1,834:1	███	0.4			70.2		synthetic, web-scale	Jun/2026	🟢	C	https://arxiv.org/abs/2606.16140	Reasoning	3B dense reasoning model scoring 94.3 on AIME26 and 80.2 on LiveCodeBench v6; ma...	884	███	███	███	███	MIT	███	China	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███
Rio-3.5-Open-397B	IplanRIO	https://huggingface.co/prefeitura-rio/Rio-3.5-Open-397B	397	17	MoE	███	91:1	███	12.6		88	90.9	36.5	synthetic, web-scale	Jun/2026	🟢	C	https://huggingface.co/prefeitura-rio/Rio-3.5-Open-397B	Reasoning	Merge of Qwen 3.5 397B + Next-N2-Pro. IplanRIO is Rio de Janeiro municipal IT co...	883	███	███	███	███	MIT	1,010,000	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███
openPangu 2.0 Pro	Huawei	pending 30/jun	505	18	MoE	19,000	38:1	███	10.3					synthetic, web-scale	Jun/2026	███	C	https://gitcode.com/ascend-tribe		Open-source MoE with record 28:1 sparsity ratio. DSA+SWA hybrid attention archit...	882	███	███	███	███	Other	███	China	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███
Kimi-K2.7-Code	Moonshot AI	https://huggingface.co/moonshotai/Kimi-K2.7-Code	1000	32	MoE	30,500	31:1	███	18.4					synthetic, web-scale	Jun/2026	🟢	A	https://huggingface.co/moonshotai/Kimi-K2.7-Code	Reasoning	Coding-focused agentic model built upon Kimi K2.6. Reduces thinking-token usage ...	███	███	███	███	███	Other	262,144	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███
Nex-N2-Pro	Nex AGI	https://huggingface.co/nex-agi/Nex-N2-Pro	397	17	███	36,000	91:1	███	12.6			90.7		synthetic, web-scale	Jun/2026	🟢	C	https://github.com/nex-agi/Nex-N2	Reasoning	Post-trained on Qwen3.5-397B-A17B. "An agentic model with Agentic Thinking." GPQ...	880	███	███	███	███	Apache 2.0	███	China	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███
DiffusionGemma 26B A4B IT	Google DeepMind	https://huggingface.co/google/diffusiongemma-26B-A4B-it	25.2	3.8	MoE	14,000	556:1	███	2.0		77.6	73.2	11.9	web-scale	Jun/2026	🟢	C	https://huggingface.co/google/diffusiongemma-26B-A4B-it	Reasoning, Diffusion	███	879	███	███	███	███	███	256,000	USA	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███
Apodex-1.0-H	Apodex AI	https://apodex.ai	███	17	MoE	36,000	91:1	███	12.6				60.8	synthetic, web-scale	Jun/2026	🟢	D	https://www.apodex.com/blog/apodex-1.0	Reasoning	Verification-centric deep-research agent team on Qwen3.5 base. Heavy-duty mode c...	878	███	███	███	███	███	262,144	USA	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███
Claude Fable 5	Anthropic	https://claude.ai/	6000	400	MoE	250,000	42:1	███	129.1			94.1	64.5	synthetic, web-scale	Jun/2026	🟢	D	███	Reasoning, SOTA	Mythos-class model made safe for general use. Same underlying model as Claude My...	877	███	███	███	███	Proprietary	200,000	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███
North-Mini-Code-1.0	Cohere	https://huggingface.co/CohereLabs/North-Mini-Code-1.0	███	3	MoE	12,000	400:1	███	2.0					synthetic, web-scale	Jun/2026	🟢	C	https://huggingface.co/blog/CohereLabs/introducing-north-mini-code		30B-A3B MoE (128 experts, 8 active per token) optimized for agentic software eng...	876	███	███	███	███	███	256,000	Canada	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███
AFM 3 Core Advanced	Apple	https://machinelearning.apple.com/research/introducing-third-generation-of-apple-foundation-models	20	4	MoE	25,000	1,250:1	███	2.4					synthetic, web-scale	Jun/2026	🟢	███	https://machinelearning.apple.com/research/introducing-third-generation-of-apple-foundation-models		Most powerful Apple on-device model. 20B params stored in flash (NAND); 1–4B act...	875	███	███	███	███	Proprietary	███	USA	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███
AFM 3 Cloud Pro	Apple	https://machinelearning.apple.com/research/introducing-third-generation-of-apple-foundation-models	1200	60	MoE	60,000	50:1	███	28.3	███				synthetic, web-scale	Jun/2026	🟢	D	https://machinelearning.apple.com/research/introducing-third-generation-of-apple-foundation-models		Apple–Google–NVIDIA collaboration. Based on custom 1.2T-parameter Gemini model w...	874	███	███	███	███	███		USA	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███
Macaron-V1-Preview-749B	Mind Lab	https://macaron-model-previews.macaron.im/	749	41	███	28,500	39:1	███	15.4					synthetic, web-scale	Jun/2026	🟢	A	https://macaron.im/mindlab/research/macaron-v1-preview		749B Mixture-of-LoRA agent model post-trained from GLM-5.1 (744B frozen base + 5...	873	███	███	███	███	MIT	202,752	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███
Gemma 4 12B	Google DeepMind	https://huggingface.co/google/gemma-4-12B-it	12		Dense	14,000	1,167:1	███	1.4		77.2	78.8	5.2	███	Jun/2026	🟢	C	https://huggingface.co/google/gemma-4-12B-it	Reasoning	Encoder-free multimodal (text, image, audio) dense model with configurable think...	872	███	███	███	███	███	256,000	USA	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███
Aion-1.0-Plan	Microsoft		14		Dense	14,000	███	███	1.5					synthetic, web-scale	Jun/2026	🟢	D	https://blogs.windows.com/windowsdeveloper/2026/06/02/build-2026-furthering-windows-as-the-trusted-platform-for-development/	Reasoning	On-device reasoning and tool-calling SLM that ships in-box as part of Windows on...	871	███	███	███	███	Proprietary	32,000	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███
Aion-1.0-Instruct	Microsoft	https://microsoftedge.github.io/Demos/built-in-ai/playgrounds/prompt-api/	2		Dense	8,000	4,000:1	███	0.4					███	Jun/2026	🟢	D	https://blogs.windows.com/msedgedev/2026/06/02/expanding-on-device-ai-in-microsoft-edge-new-models-and-apis-for-the-web/		Pre-release small language model for on-device AI in Microsoft Edge (Canary/Dev)...	870	███	███	███	███	MIT	███	USA	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███
MAI-Code-1-Flash	Microsoft	https://github.blog/changelog/2026-06-02-mai-code-1-flash-is-now-available-for-github-copilot/	30		Dense	15,000	500:1	███	2.2					web-scale	Jun/2026	🟢	D	https://microsoft.ai/news/introducingmai-code-1-flash/	███	Lightweight agentic coding model from Microsoft AI, built end-to-end on clean an...	869	███	███	███	███	███		USA	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███
MAI-Thinking-1	Microsoft	███	1000	35	MoE	33,500	34:1	███	19.3		85	84.2		web-scale	Jun/2026	🟢	A	https://microsoft.ai/wp-content/uploads/2026/06/main_20260602_2.pdf	Reasoning	Microsoft AI's reasoning model. 35B-active, ~1T-total parameters sparse MoE. Tra...	868	███	███	███	███	Proprietary	███	USA	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███
KeyLM-75M-Instruct	Independent	https://huggingface.co/Eclipse-Senpai/KeyLM-75M-Instruct	0.0753		Dense	18	240:1	███	0.004	24				web-scale	Jun/2026	🟢	A	https://huggingface.co/Eclipse-Senpai/KeyLM-75M-Instruct	███	75M-param from-scratch small LM; competitive on IFEval vs SmolLM-135M-Instruct a...	867	███	███	███	███	Apache 2.0	2,048	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███
Cosmos 3 Super	NVIDIA	https://huggingface.co/nvidia/Cosmos3-Super	64	32	MoE	200	4:1	███	0.4					special	Jun/2026	🟢	███	https://research.nvidia.com/labs/cosmos-lab/cosmos3/technical-report.pdf	SOTA	Omnimodal world model for Physical AI; dual-tower mixture-of-transformers (reaso...	866	███	███	███	███	███	262,144	USA	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███
Mellum2-12B-A2.5B-Thinking	JetBrains	https://huggingface.co/JetBrains/Mellum2-12B-A2.5B-Thinking	12	2.5	MoE	10,600	884:1	███	1.2			57.6		███	Jun/2026	🟢	A	https://arxiv.org/abs/2605.31268	Reasoning	Open-weight 12B MoE (64 experts, 8 active) language model specialised in softwar...	865	███	███	███	███	Apache 2.0	███	USA	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███
Qwen3.7-Plus	Alibaba	https://chat.qwen.ai/	480	35	MoE	36,000	75:1	███	13.9		88.5	90.3	34.7	synthetic, web-scale	Jun/2026	🟢	D	███	Reasoning	Multimodal agent model unifying vision and language; operates GUI and CLI within...	864	███	███	███	███	Proprietary	███	China	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███
Nemotron 3 Ultra	NVIDIA	https://huggingface.co/nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16#nvidia-nemotron-3-ultra-550b-a55b-bf16	███	55	MoE	25,000	46:1	███	12.4	89.1	86.8	87	37.4	synthetic, web-scale	Jun/2026	🟢	C	https://arxiv.org/abs/2512.20856		NVIDIA’s largest open model: 550B total parameters with up to 55B active per tok...	863	███	███	███	███	Other	███	USA	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███
MiniMax-M3	MiniMax	https://huggingface.co/MiniMaxAI/MiniMax-M3	428	23	MoE	100,000	███	███	21.8					synthetic, web-scale	Jun/2026	🟢	C	https://www.minimax.io/blog/minimax-m3	Reasoning, SOTA	"M3 is a model that has undergone mixed-modality training from Step 0... After r...	862	███	███	███	███	Proprietary	1,000,000	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███
Step 3.7 Flash	StepFun	https://huggingface.co/stepfun-ai/Step-3.7-Flash	198	11	MoE	24,000	122:1	███	7.3				49.7	synthetic, web-scale	May/2026	🟢	A	███	Reasoning	A high-efficiency Flash model for real-world agents.	861	███	███	███	███	Apache 2.0	███	China	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███
LFM2.5-8B-A1B	Liquid AI	https://huggingface.co/LiquidAI/LFM2.5-8B-A1B	8.3	1.5	MoE	38,000	4,579:1	███	███					synthetic, web-scale	May/2026	🟢	A	https://www.liquid.ai/blog/lfm2-5-8b-a1b	Reasoning	Edge MoE for fast on-device tool calling; 128K context, reasoning-only model. Hi...	860	███	███	███	███	Other	███	USA	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███
Claude Opus 4.8	Anthropic	https://claude.ai/	███	250	MoE	80,000	16:1	███	66.7			93.6	57.9	synthetic, web-scale	May/2026	🟢	D	https://www.anthropic.com/claude-opus-4-8-system-card	Reasoning, SOTA	Announce: https://www.anthropic.com/news/claude-opus-4-8 HLE=with tools (49.8 no...	859	███	███	███	███	███	200,000	USA	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███
ESMC 6B	Biohub	https://huggingface.co/biohub/ESMC-6B	6		███	6,600	1,100:1	███	0.7					special	May/2026	🟢	A	https://biohub.ai/papers/esm_protein.pdf		“Language Modeling Materializes a World Model of Protein Biology”. Protein langu...	858	███	███	███	███	Non-commercial research	2,000	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███
MiniCPM5-1B	OpenBMB	https://huggingface.co/spaces/openbmb/MiniCPM5-1B-Demo	1.08		Dense	8,000	███	███	0.3		48.85	26.26		synthetic, web-scale	May/2026	🟢	A	https://huggingface.co/openbmb/MiniCPM5-1B	Reasoning	the first model in the MiniCPM5 series. It is a dense 1B Transformer built for o...	857	███	███	███	███	Apache 2.0	32,768	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███
Gated DeltaNet-2	NVIDIA	https://github.com/NVlabs/GatedDeltaNet-2	1.3		Dense	100	77:1	███	0.04					web-scale	May/2026	🟢	A	https://github.com/NVlabs/GatedDeltaNet-2/blob/main/paper/GDN2_paper.pdf		███	856	███	███	███	███	Other	4,000	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███
Command A+	Cohere	https://huggingface.co/CohereLabs/command-a-plus-05-2026-bf16	218	25	MoE	20,000	92:1	███	7.0					synthetic, web-scale	May/2026	███	C	https://huggingface.co/CohereLabs/command-a-plus-05-2026-bf16		"open source model with 25 billion active parameters and 218B total parameters m...	855	███	███	███	███	Apache 2.0	131,072	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███
Qwen3.7-Max	Alibaba	https://chat.qwen.ai/	2000	100	MoE	40,000	20:1	███	29.8		89.6	92.4	53.5	synthetic, web-scale	May/2026	🟢	D	https://qwen.ai/blog?id=qwen3.7	Reasoning	"Qwen3.7-Max, our latest proprietary model designed for the agent era." 35-hour ...	███	███	███	███	███	███	131,072	China	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███
HRM-Text-1B	Sapient Intelligence	https://huggingface.co/sapientinc/HRM-Text-1B	1		Dense	160	160:1	███	0.04	60.7				synthetic, web-scale	May/2026	🟢	A	https://github.com/sapientinc/HRM-Text	███	"1B text generation model based on the HRM architecture, strengthened by task co...	853	███	███	███	███	███	4,096	Singapore	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███	███