Falcon H1 7B Instruct
Falcon H1 7B deployment guidance for Arabic-heavy production use.
1. What it is
Falcon H1 7B Instruct (`tiiuae/Falcon-H1-7B-Instruct`) is a 7B instruction-tuned model from TII UAE.
2. What it does
It provides balanced Arabic and English chat/instruction performance with strong throughput for API serving.
3. How it compares
- Versus ALLaM 7B: Falcon H1 can be a good balance choice when you need bilingual coverage with strong speed.
- Versus JAIS 13B: lower memory cost and generally lower latency, but JAIS may outperform on high-precision Arabic enterprise prompts.
4. Best for on DCP
- Bilingual assistants with high request volume
- Regional support bots
- Arabic-first copilots where 24 GB GPUs are available
5. Hardware requirements on DCP
- DCP floor: `min_vram_gb: 24` (Tier A)
- Recommended providers: RTX 4090, A100 40GB+
- Template: `vllm-serve`
6. How to run on DCP
- Use `job_type: "vllm_serve"` with `params.model: "tiiuae/Falcon-H1-7B-Instruct"`.
- Set `gpu_requirements.min_vram_gb` to `24`.
- Consume the OpenAI-compatible endpoint returned by job output.
7. Licensing and commercial-use notes
Falcon models use TII model license terms. Validate your intended commercial use (hosting, redistribution, derivatives) against the card license text.
Sources:
- https://huggingface.co/tiiuae/Falcon-H1-7B-Instruct
- /home/node/dc1-platform/infra/config/arabic-portfolio.json
- /home/node/dc1-platform/backend/src/db.js