Lunos offers free daily token requests for select AI models. This feature allows you to use certain models without incurring costs, up to a specified token limit each day. This guide explains how free daily requests work and how to make the most of them.
Note: Free daily requests are available for select models only. Look for the 🎁 gift icon next to model names in the models table to identify which ones offer free daily tokens.
Free daily requests allow you to use certain AI models up to a specified token limit each day without being charged. Here's how the feature works:
Important: Some models with free daily requests require a premium account. Currently, gemini 2 flash and gpt 4.1 mini are available only to premium users, while gemma 3 12B, llama 4 scout, and gemini 2 flash lite are available to all users.
The following models currently offer free daily token allowances. The specific token limits may change over time as we adjust our offerings.
Model | Provider | Free Daily Tokens | Premium Required |
---|---|---|---|
gemma 3 12B | 15K | No | |
llama 4 scout | Meta | 20K | No |
gemini 2 flash lite | 25K | No | |
gemini 2 flash | 30K | Yes | |
gpt 4.1 mini | OpenAI | 25K | Yes |
For the most up-to-date list of models with free daily tokens, visit the Models page and look for models with the 🎁 gift icon.
Using models with free daily requests is no different from using any other model through the Lunos API. Simply specify the model in your API request as you normally would:
1import OpenAI from 'openai';
2
3const client = new OpenAI({
4 apiKey: 'your_api_key_here',
5 baseURL: 'https://api.lunos.tech/v1',
6});
7
8async function generateText() {
9 const completion = await client.chat.completions.create({
10 model: "gemma-3-12b", // Model with free daily tokens
11 messages: [
12 { role: "system", content: "You are a helpful assistant." },
13 { role: "user", content: "Tell me about free daily requests on Lunos." }
14 ],
15 });
16
17 console.log(completion.choices[0].message.content);
18}
19
20generateText();
You can monitor your free daily token usage in the Usage Statistics section of your dashboard. The dashboard provides:
To make the most of your free daily tokens, consider these best practices:
1// Example of setting maximum tokens to control usage
2const completion = await client.chat.completions.create({
3 model: "meta-llama/llama-4-scout",
4 messages: [
5 { role: "system", content: "You are a helpful assistant." },
6 { role: "user", content: "Summarize the concept of free daily tokens." }
7 ],
8 max_tokens: 100, // Limit the response to 100 tokens
9});
No, unused free tokens do not roll over. Your free token allowance resets at midnight UTC each day.
Yes, you can use free tokens for any purpose, including production applications. However, for production use cases with consistent traffic, we recommend having a billing method set up to ensure uninterrupted service when you exceed the free limits.
You can monitor your usage in the dashboard. Additionally, once you exceed your free token limit, requests will continue to work but will be billed at the standard rate.
Free tokens are subject to the same usage policies and terms of service as paid usage. There are no additional restrictions specific to free tokens.
Get Started: Ready to try out free daily requests? Check out our Quick Start Guide to begin using the Lunos API with models that offer free daily tokens.
No headings found on this page.