TinyLLaMA

A compact, open-source LLaMA-based language model (~1.1 B parameters) pretrained on trillions of tokens under the Apache 2.0 license.

Visit Website

Visit Website

TinyLLaMA is an open-source language model by jzhang38’s team, designed as a lightweight yet capable alternative to larger LLaMA models. Its 1.1 B base model is trained on an impressive corpus of 3 trillion tokens following the original LLaMA architecture and tokenizer . The project includes fully reproducible checkpoints, a chat-finetuned variant, and shared evaluation benchmarks.As a schlacker-optimized, lightweight model, TinyLLaMA serves as a practical alternative to larger models like LLaMA‑3.1 or GPT‑NeoX when computational resources are limited, without sacrificing strong performance

Key features include:

1‑2 B parameter model retrained with LLaMA‑architecture on 3 T tokens
Fully open artifacts: code, training checkpoints, data, and evaluation logs
Chat-finetuned version available for dialogue applications
Apache 2.0 license, permitting commercial use
Plug-and-play compatibility with LLaMA ecosystem tools and pipelines

Use cases include:

Deploying efficient LLMs on edge or constrained hardware (e.g., ~637 MB 4‑bit quantized model)
Research and benchmarking on compact LLaMA‑style models
Integration into chatbots, assistant tools, or on-device NLP systems

Back

Information

Websitegithub.com
Published date2025/01/15

Explore More Tools

Jitsi

It is a free, open-source video conferencing solution for web and mobile, offering secure, flexible, and feature-rich communication capabilities.

Moxin-LLM

A fully open-source large language model suite offering reproducible training, open weights, and instruction-tuned variants under the Apache 2.0 license.

AI & Machine LearningWeb & App Development

GPT‑NeoX

An open-source, large-scale language model training library and model family developed by EleutherAI, enabling cutting-edge autoregressive model development and deployment.

AI & Machine LearningWeb & App Development

OLMo

A fully open-source LLM framework from the Allen Institute for AI, providing model weights, training code, data, and evaluation tools in a transparent package.

AI & Machine LearningWeb & App Development

Qwen

An open-source family of large language models developed by Alibaba Cloud, featuring scalable model sizes and versions released under permissive licenses like Apache 2.0 .

AI & Machine LearningWeb & App Development

Stay Updated

Subscribe to our newsletter for the latest news and updates about Open Source Alternatives

TinyLLaMA

Information

Categories

Tags

Explore More Tools

Jitsi

Moxin-LLM

GPT‑NeoX

OLMo

Qwen

Stay Updated