The /v1/models
endpoint provides a list of all available AI models for chat completions and other generative tasks. Use this endpoint to discover model IDs, capabilities, pricing, and other metadata before making requests to the Lunos API.
GET /v1/models
No authentication is required to access this endpoint.
The response is a JSON object containing a data array of model objects. Each model object contains details about a model, including its ID, name, provider, capabilities, pricing, status, description, and summary.
1{
2 "data": [
3 {
4 "id": "openai/gpt-4o",
5 "name": "GPT-4o",
6 "parameters": {
7 "context": 128000,
8 "max_output_tokens": 4096,
9 "size": "4B"
10 },
11 "provider": "openai",
12 "pricePerMillionTokens": {
13 "input": 5.0,
14 "output": 15.0,
15 "discountPercentage": 0
16 },
17 "capabilities": ["text-generation", "chat", "vision"],
18 "status": "available",
19 "description": "GPT-4o is OpenAI's flagship model, ideal for complex tasks that require advanced reasoning and creative output.",
20 "summary": "Advanced reasoning model for complex tasks and creative output.",
21 "freeConfig": {
22 "maxTokens": 1000,
23 "isPremium": false
24 }
25 }
26 ]
27}
Field | Type | Description |
---|---|---|
id | string | Unique model identifier (use this in API requests). |
name | string | Human-readable model name. |
parameters.context | number | Maximum context window (tokens). |
parameters.max_output_tokens | number | Maximum output tokens per request. |
parameters.size | string | Model size (e.g., "4B"). |
provider | string | Model provider (e.g., "openai"). |
pricePerMillionTokens.input | number | Cost per 1M input tokens (USD). |
pricePerMillionTokens.output | number | Cost per 1M output tokens (USD). |
pricePerMillionTokens.discountPercentage | number | Discount percentage applied to pricing. |
capabilities | string[] | Array of supported features (e.g., text-generation , chat , vision ). |
status | string | Model availability status (e.g., available ). |
description | string | Detailed description of the model and its use cases. |
summary | string | Short summary of the model (less than 50 words). |
freeConfig | object (optional) | Free tier configuration if available. |
freeConfig.maxTokens | number | Maximum tokens available in free tier. |
freeConfig.isPremium | boolean | Whether the model requires premium access. |
curl -X GET https://api.lunos.tech/v1/models
1{
2 "data": [
3 {
4 "id": "openai/gpt-4o",
5 "name": "GPT-4o",
6 "parameters": {
7 "context": 128000,
8 "max_output_tokens": 4096,
9 "size": "4B"
10 },
11 "provider": "openai",
12 "pricePerMillionTokens": {
13 "input": 5.0,
14 "output": 15.0,
15 "discountPercentage": 0
16 },
17 "capabilities": ["text-generation", "chat", "vision"],
18 "status": "available",
19 "description": "GPT-4o is OpenAI's flagship model, ideal for complex tasks that require advanced reasoning and creative output.",
20 "summary": "Advanced reasoning model for complex tasks and creative output.",
21 "freeConfig": {
22 "maxTokens": 1000,
23 "isPremium": false
24 }
25 },
26 {
27 "id": "openai/text-embedding-3-small",
28 "name": "Text Embedding 3 Small",
29 "parameters": {
30 "context": 8192,
31 "max_output_tokens": 2000,
32 "size": "1B"
33 },
34 "provider": "openai",
35 "pricePerMillionTokens": {
36 "input": 0.1,
37 "output": 0.3,
38 "discountPercentage": 10
39 },
40 "capabilities": ["embedding"],
41 "status": "available",
42 "description": "Small and efficient embedding model for semantic search and similarity tasks.",
43 "summary": "Efficient embedding model for semantic search and similarity.",
44 "freeConfig": {
45 "maxTokens": 5000,
46 "isPremium": false
47 }
48 }
49 ]
50}
id
field as the model
parameter in your API requests.capabilities
array to ensure the model supports your use case (e.g., chat
, vision
, embedding
).status
field to confirm the model is available
before using it in production.pricePerMillionTokens
to optimize for cost based on your expected usage.parameters
for model-specific limits like context window and max output tokens.No headings found on this page.