BGE Reranker v2 M3
Reranker model guide for Arabic retrieval quality uplift on DCP.
1. What it is
BGE Reranker v2 M3 (`BAAI/bge-reranker-v2-m3`) is a multilingual cross-encoder reranker from BAAI.
2. What it does
It reranks candidate documents after retrieval to improve top-k precision in search and RAG.
3. How it compares
- Versus BGE-M3 embeddings: reranker is slower but higher quality for final ranking.
- Versus using only ANN retrieval: reranker usually improves factual relevance on difficult Arabic queries.
4. Best for on DCP
- Second-stage ranking in Arabic RAG
- Enterprise search relevance tuning
- High-quality retrieval for legal/finance KBs
5. Hardware requirements on DCP
- DCP floor: `min_vram_gb: 8` (Tier B)
- Recommended providers: 8 GB+ GPUs
- Template: `arabic-reranker`
6. How to run on DCP
- Start with template `arabic-reranker`.
- Use default model `BAAI/bge-reranker-v2-m3`.
- Adjust `TOP_K` and `MAX_SEQ_LEN` in template env vars based on workload.
7. Licensing and commercial-use notes
BGE reranker model card uses MIT. Commercial usage is usually allowed, but verify your compliance obligations for dataset and downstream application policy.
Sources:
- https://huggingface.co/BAAI/bge-reranker-v2-m3
- /home/node/dc1-platform/docker-templates/arabic-reranker.json
- /home/node/dc1-platform/infra/config/arabic-portfolio.json