Quickstart

What is NexaAI Python SDK?

The NexaAI Python SDK provides a comprehensive API for on-device AI inference across multiple modalities. It supports:

Large Language Models (LLM): Text generation and conversation
Vision-Language Models (VLM): Multimodal understanding and generation
Embedder: Text vectorization and similarity computation
Reranker: Document reranking
ASR (Automatic Speech Recognition): Speech-to-text transcription
CV (Computer Vision): OCR/text recognition
TTS (Text-to-Speech): Text-to-speech synthesis
ImageGen: Image generation from text prompts
Diarize: Speaker diarization

Choose Your Platform

The SDK supports multiple platforms with optimized backends. Select your platform for detailed setup instructions:

macOS

Apple Silicon optimized
Python 3.10 • MLX backend • Metal acceleration

Windows x64

CPU/GPU acceleration
Python 3.10 • GGUF models • CUDA support

Windows ARM64

NPU acceleration
Python 3.11-3.13 ARM64 • Snapdragon X Elite

Quick Overview

Installation

Each platform has specific installation requirements. Follow your platform guide for detailed instructions:

macOS: pip install 'nexaai[mlx]'
Windows x64: pip install nexaai
Windows ARM64: pip install nexaai

Authentication

Set up your NexaAI token from https://sdk.nexa.ai/:

# Linux/macOS
export NEXA_TOKEN="key/your_token_here"

# Windows
$env:NEXA_TOKEN="key/your_token_here"

Basic Usage

Here’s a simple example to get you started:

from nexaai import LLM, GenerationConfig, ModelConfig, LlmChatMessage

# Initialize model (platform-specific)
model_name = "NexaAI/Qwen3-1.7B-4bit-MLX"  # Example for macOS
config = ModelConfig()
llm = LLM.from_(model=model_name, config=config)

# Create conversation
conversation = [
    LlmChatMessage(role="system", content="You are a helpful assistant."),
    LlmChatMessage(role="user", content="Hello, how are you?")
]

# Generate response
prompt = llm.apply_chat_template(conversation)
for token in llm.generate_stream(prompt, GenerationConfig(max_tokens=100)):
    print(token, end="", flush=True)

This is a simplified example. For complete setup instructions, model recommendations, and platform-specific optimizations, please refer to your platform guide above.

Next Steps

Choose your platform and follow the detailed setup guide
Explore the API Reference for comprehensive documentation
Check out platform-specific examples in your chosen guide

Was this page helpful?

Yes

Get Started

Nexa CLI Usage

Android SDK

Linux Docker

Python Library

Community

What is NexaAI Python SDK?

Choose Your Platform

macOS

Windows x64

Windows ARM64

Quick Overview

Installation

Authentication

Basic Usage

Next Steps

Get Started

Nexa CLI Usage

Android SDK

Linux Docker

Python Library

Community

​What is NexaAI Python SDK?

​Choose Your Platform

macOS

Windows x64

Windows ARM64

​Quick Overview

​Installation

​Authentication

​Basic Usage

​Next Steps

What is NexaAI Python SDK?

Choose Your Platform

Quick Overview

Installation

Authentication

Basic Usage

Next Steps