Introducing the Technology Innovation Institute’s Falcon 3: Making Advanced AI accessible and Available to Everyone, Everywhere

Falcon 3: Making Advanced AI Accessible and Available to Everyone, Everywhere Experience unmatched performance and scalability on lightweight devices such as laptop and energy constraint infrastructure

Try Falcon 3

Download Falcon 3 Now

About Falcon 3

Revolutionizing
AI for All

Falcon 3 has been meticulously designed to address this gap with multimodal capabilities.

The addition of multimodal capabilities – image, video, and audio – further elevates the Falcon 3 family, pushing the limits of open-source AI with unprecedented performance and usability. As an opensource large language model (LLM), Falcon 3 is designed to democratize advanced AI by combining outstanding performance with the ability to run on lightweight devices, including laptops.

Released under TII’s Falcon License 2.0, Falcon 3 is a pioneering step toward making advanced AI tools available to all.

Performance
Benchmark

Falcon3 - Vision

Falcon3 - Video

Falcon3 - Audio

New multimodal functionalities: Falcon 3 Vision, Video, and Audio

Falcon 3 now offers new multimodal capabilities, processing not only text, but also images, and for the first time in the Falcon series, video and audio. These enhancements open exciting new possibilities for media analysis and interactive user experiences. With image processing, Falcon 3 excels in object recognition, scene description and visual charts interpretation. Falcon 3’s image processing capabilities outperform other open-source models on most standard vision benchmarks, delivering remarkable accuracy. Its video processing capabilities empower users to analyze and extract insights from dynamic content, offering features such as video content summarization and question answering on video streams or video recordings up to one hour long. Audio capabilities enable content analysis and understanding for speech, sound and music and allow speech transcription, summarization and acoustic patterns recognition. All outputs from Falcon 3 Vision, Video, and Audio are provided in text format, ensuring clarity and usability across various scenarios and enabling seamless ecosystem integration and potential model cooperation. The multimodal version of Falcon 3 currently supports English for processing audio, video, and image data. By redefining multimodality, Falcon 3 broadens the horizon of what large language models can achieve, setting a new benchmark for versatility and innovation across industries.

Vision: Falcon 3’s image base models have a vocabulary size of 131K, 32K context, GQA, Llama compatible architecture, fast inference speed (30 tokens/s), low latency (3.3s), and low memory consumption (30.23GB) all with exceptional zero-shot and few-shot performance on the open leaderboards Video: Our video-language based model with visual and language decoder works on 131K vocab size, 32K context, GQA and has Llama compatible architecture. With our state-of-the-art 10B model and competitive 7B model, all working exceptionally when compared with similar-sized open models. Audio: Our Falcon3 Audio models have displayed exceptional performance across speech, music, and mixed audio. The 7B model ranks second overall, outperforming larger models like SALMONN (13B) in key metrics. Even our lightweight models (3B and 1B) deliver competitive results, making Falcon3 Audio an outstanding choice for diverse audio applications, from sound analysis to speech recognition, while maintaining efficiency in the small model category.

Our Ambitions for Falcon 3

Democratized AI Access Falcon 3 by TII offers models that are small, efficient, and capable of running on lightweight infrastructures. It ensures high performance without requiring extensive computational resources.

High Accessibility & Performance Designed for developers, researchers, and businesses, Falcon 3 empowers users to leverage cutting-edge AI tools while maintaining ease of use and accessibility.

Access to State of the Art Multimodal Capabilities in AI Falcon 3 models now feature image, video, and audio analysis and understanding with exceptional performance, offering advanced AI capabilities for the open-source AI community.

Improved Efficiency & Fine-Tuning Falcon 3 builds on the success of Falcon 2, delivering enhanced reasoning, fine-tuning capabilities, and improved efficiency across a wide range of use cases.

Commitment to Innovation Reinforcing Technology Innovation Institutes’s (TII) mission, Falcon 3 fosters inclusive, open-source innovation, providing the global community with state-of-the-art AI models.

Model Architecture

Optimized Decoder-Only Design

Falcon 3’s architecture is based on a decoder-only design using flash attention 2 to grouped query attention. It integrates Grouped Query Attention (GQA) to share parameters, minimizing memory for Key-Value (KV) cache during inference, ensuring faster and more efficient operations.

Advanced Tokenization

With a tokenizer supporting a high vocabulary of 131K tokens—double that of Falcon 2—Falcon 3 offers superior compression and improved downstream performance, enhancing its ability to handle diverse tasks.

Enhanced Long-Context Training

Trained natively with a 32K context size, Falcon 3 demonstrates exceptional long-context capabilities, delivering enhanced performance for extended input data compared to its predecessors.

High-Performing Multimodal Models

Falcon 3 Vision, Video, and Audio all provide modality-to-text capabilities, enabling seamless eco-system integration and potential model cooperation. The multimodal version of Falcon 3 supports English for processing audio, video, and image data seamlessly.

The Falcon 3 series represents a huge leap forward in AI technology. Trained on an impressive 14 Trillions tokens, Falcon 3 more than doubles the capacity of its predecessor, Falcon 180B, ensuring a significant boost in performance and capability. The initial training was followed by multiple stages to improve reasoning and math performance with high-quality data and context extension with natively long context data. Falcon 3 was trained on 4 main languages (English, Spanish, Portuguese and French) to ensure a much higher, earning capability and quality for those languages. The inclusion of multimodal capabilities advances Falcon 3, offering enhanced support to the open-source community.

Revolutionizing AI for All

Falcon3 - Vision

Falcon3 - Video

Falcon3 - Audio

New multimodal functionalities: Falcon 3 Vision, Video, and Audio

Advanced AI for Everyone, Everywhere

Revolutionizing
AI for All