Falcon H1 7B Instruct

Falcon H1 7B deployment guidance for Arabic-heavy production use.

1. What it is

Falcon H1 7B Instruct (`tiiuae/Falcon-H1-7B-Instruct`) is a 7B instruction-tuned model from TII UAE.

2. What it does

It provides balanced Arabic and English chat/instruction performance with strong throughput for API serving.

3. How it compares

Versus ALLaM 7B: Falcon H1 can be a good balance choice when you need bilingual coverage with strong speed.
Versus JAIS 13B: lower memory cost and generally lower latency, but JAIS may outperform on high-precision Arabic enterprise prompts.

4. Best for on DCP

Bilingual assistants with high request volume
Regional support bots
Arabic-first copilots where 24 GB GPUs are available

5. Hardware requirements on DCP

DCP floor: `min_vram_gb: 24` (Tier A)
Recommended providers: RTX 4090, A100 40GB+
Template: `vllm-serve`

6. How to run on DCP

Use `job_type: "vllm_serve"` with `params.model: "tiiuae/Falcon-H1-7B-Instruct"`.
Set `gpu_requirements.min_vram_gb` to `24`.
Consume the OpenAI-compatible endpoint returned by job output.

7. Licensing and commercial-use notes

Falcon models use TII model license terms. Validate your intended commercial use (hosting, redistribution, derivatives) against the card license text.

Sources:

https://huggingface.co/tiiuae/Falcon-H1-7B-Instruct
infra/config/arabic-portfolio.json
backend/src/db.js

Docs