The predominance of English and Latin-based large language models (LLMs) has led to a notable deficit in native arabic llms. This discrepancy is accentuated by the prevalent inclusion of English tokens in existing Arabic models, detracting from their efficacy in processing native Arabi