Models API Endpoint

The /v1/models endpoint provides a list of all available AI models for chat completions and other generative tasks. Use this endpoint to discover model IDs, capabilities, pricing, and other metadata before making requests to the Lunos API.

Endpoint

GET /v1/models

Authentication

No authentication is required to access this endpoint.

Response

The response is a JSON object containing a data array of model objects. Each model object contains details about a model, including its ID, name, provider, capabilities, pricing, status, description, and summary.

1{
2  "data": [
3    {
4      "id": "openai/gpt-4o",
5      "name": "GPT-4o",
6      "parameters": {
7        "context": 128000,
8        "max_output_tokens": 4096,
9        "size": "4B"
10      },
11      "provider": "openai",
12      "pricePerMillionTokens": {
13        "input": 5.0,
14        "output": 15.0,
15        "discountPercentage": 0
16      },
17      "capabilities": ["text-generation", "chat", "vision"],
18      "status": "available",
19      "description": "GPT-4o is OpenAI's flagship model, ideal for complex tasks that require advanced reasoning and creative output.",
20      "summary": "Advanced reasoning model for complex tasks and creative output.",
21      "freeConfig": {
22        "maxTokens": 1000,
23        "isPremium": false
24      }
25    }
26  ]
27}

Model Object Fields

Field	Type	Description
`id`	string	Unique model identifier (use this in API requests).
`name`	string	Human-readable model name.
`parameters.context`	number	Maximum context window (tokens).
`parameters.max_output_tokens`	number	Maximum output tokens per request.
`parameters.size`	string	Model size (e.g., "4B").
`provider`	string	Model provider (e.g., "openai").
`pricePerMillionTokens.input`	number	Cost per 1M input tokens (USD).
`pricePerMillionTokens.output`	number	Cost per 1M output tokens (USD).
`pricePerMillionTokens.discountPercentage`	number	Discount percentage applied to pricing.
`capabilities`	string[]	Array of supported features (e.g., `text-generation`, `chat`, `vision`).
`status`	string	Model availability status (e.g., `available`).
`description`	string	Detailed description of the model and its use cases.
`summary`	string	Short summary of the model (less than 50 words).
`freeConfig`	object (optional)	Free tier configuration if available.
`freeConfig.maxTokens`	number	Maximum tokens available in free tier.
`freeConfig.isPremium`	boolean	Whether the model requires premium access.

Example Request

curl -X GET https://api.lunos.tech/v1/models

Example Response

1{
2  "data": [
3    {
4      "id": "openai/gpt-4o",
5      "name": "GPT-4o",
6      "parameters": {
7        "context": 128000,
8        "max_output_tokens": 4096,
9        "size": "4B"
10      },
11      "provider": "openai",
12      "pricePerMillionTokens": {
13        "input": 5.0,
14        "output": 15.0,
15        "discountPercentage": 0
16      },
17      "capabilities": ["text-generation", "chat", "vision"],
18      "status": "available",
19      "description": "GPT-4o is OpenAI's flagship model, ideal for complex tasks that require advanced reasoning and creative output.",
20      "summary": "Advanced reasoning model for complex tasks and creative output.",
21      "freeConfig": {
22        "maxTokens": 1000,
23        "isPremium": false
24      }
25    },
26    {
27      "id": "openai/text-embedding-3-small",
28      "name": "Text Embedding 3 Small",
29      "parameters": {
30        "context": 8192,
31        "max_output_tokens": 2000,
32        "size": "1B"
33      },
34      "provider": "openai",
35      "pricePerMillionTokens": {
36        "input": 0.1,
37        "output": 0.3,
38        "discountPercentage": 10
39      },
40      "capabilities": ["embedding"],
41      "status": "available",
42      "description": "Small and efficient embedding model for semantic search and similarity tasks.",
43      "summary": "Efficient embedding model for semantic search and similarity.",
44      "freeConfig": {
45        "maxTokens": 5000,
46        "isPremium": false
47      }
48    }
49  ]
50}

Usage Tips

Use the id field as the model parameter in your API requests.
Check the capabilities array to ensure the model supports your use case (e.g., chat, vision, embedding).
Review the status field to confirm the model is available before using it in production.
Compare pricePerMillionTokens to optimize for cost based on your expected usage.
Reference parameters for model-specific limits like context window and max output tokens.

Documentation

Getting Started

API Reference

Generative Operations

Resources