Better LLM documentation for Assistant

azdanov

It would be nice to have better documentation regarding LLMs in Kagi. Benchmark is very nice, but unless someone's experienced with the topic it doesn't offer much.

Like, how does tokens in messages work withing current soft limit of 250k: e.g

To estimate the number of messages, let's break this down:

    Average message length varies but typically ranges from 100-300 tokens
    Let's use 200 tokens as a reasonable average per message
    With a 250,000 token weekly limit
    Basic calculation: 250,000 ÷ 200 = 1,250 messages


So you could send approximately 1,250 messages per week, or about 178 messages per day.

Keep in mind that:

    Longer, more complex messages use more tokens
    System prompts and AI responses also count toward the limit
    The actual number could be lower if your messages tend to be longer

How does the context work. What to do if a model refuses to answer, e.g. Claude is notorious for refusals, and Mistral Large is very "open-minded".

It would be also nice to have the docs visible and accessible from the Assistant page, so beginners to LLMs can learn how to use them better, resulting in less frustration for them and saving costs for Kagi, since users will get results they wish faster.

slamor

azdanov Dude, even a simple repharase process costs 2000 tokens

Luis

azdanov thank you for the suggestions. I will break these down and share some initial thoughts,

1/ disambiguiate usage limits

To estimate the number of messages, let's break this down:

Average message length varies but typically ranges from 100-300 tokens
Let's use 200 tokens as a reasonable average per message
With a 250,000 token weekly limit
Basic calculation: 250,000 ÷ 200 = 1,250 messages

So you could send approximately 1,250 messages per week, or about 178 messages per day.

Keep in mind that:

Longer, more complex messages use more tokens
System prompts and AI responses also count toward the limit
The actual number could be lower if your messages tend to be longer

This is a good idea to eliminate ambiguity regarding usage limits. We'll try to add something to our Assistant help page along those lines.

2/ How do LLMs work

How does the context work. What to do if a model refuses to answer, e.g. Claude is notorious for refusals, and Mistral Large is very "open-minded".

These type of questions are best addressed through individual research, unless we opt to create and update a knowledge wiki focused on a rapidly evolving technology...

3/ Making the Assistant more accessible

It would be also nice to have the docs visible and accessible from the Assistant page, so beginners to LLMs can learn how to use them better, resulting in less frustration for them and saving costs for Kagi, since users will get results they wish faster.

We're considering using Custom Assistants to address that. For instance, providing a multiple Custom Assistants by default optimised for specific tasks or workflows, like !code