Grok-4.1
by xAI
Grok 4.1 is a new model featuring more natural, fluid dialogue while maintaining strong core reasoning capabilities. It is publicly available through our web and mobile consumer apps. As an update to Grok 4 and Grok 3, we engage in pre-deployment safety testing largely similar to that described in the Grok 4 model card. In line with our Risk Management Framework (RMF), we measure safety-relevant behaviors across three categories: abuse potential, concerning propensities, and dual-use capabilities. This report describes our evaluation methodology, results, and mitigations for these behaviors. Grok 4.1 is available in two configurations: Grok 4.1 Non-Thinking (Grok 4.1 NT), which responds directly, and Grok 4.1 Thinking (Grok 4.1 T), which reasons before responding. We evaluate both configurations with our production system prompt. We also deploy these models with safeguards which we describe and evaluate in this report, including a new and more robust input filter model. Finally, we discuss our dual-use capability evaluations. Model card: https://data.x.ai/2025-11-17-grok-4-1-model-card.pdf
Potential Risks
1 considerations identified
Review recommended before use
These considerations are automatically identified based on publicly available information about the vendor and AI catalog data. Actual risks may vary based on your specific use case and implementation.
EU Alternatives
Discover EU-based alternatives for this AI application.
Ready to manage AI applications?
Track, assess, and govern your AI applications with Anove.