Kagi Assistant only summarizes PDFs instead of processing full contents (selectable text and handwritten notes)

Pum

When providing a PDF file to Kagi Assistant, it appears to only generate a high-level summary of the file rather than parsing the actual content. When uploading a PDF containing selectable text and handwritten math solutions, the assistant fails to read the specific details. Instead of reviewing the solutions as requested, it outputs a summary of the document's general topic and asks the user to manually provide the detailed steps.

Steps to replicate:

Open Kagi Assistant.
Upload a PDF that includes both selectable text and handwritten notes (e.g., a solved worksheet).
Prompt the assistant to evaluate or correct the solutions in the document.
The assistant will fail to read the specific content, provide a basic summary of the document, and prompt you to supply the actual text.

The assistant should pass the full file contents to the underlying model so it can read and process both selectable text and handwritten notes. When uploading the exact same PDF directly to Google AI Studio, ChatGPT, or the native Gemini app, the models successfully read the handwritten steps and provide targeted feedback on the solutions. Kagi Assistant should replicate this behavior and allow the selected LLM to fully analyze the document's contents, rather than defaulting to a generic summary mechanism.

What happens and comparisons against chatgpt/ai studio)
kagi
chatgpt

Anonymous26

Did you really think they would pass files into context window?

If they do the cost would be 50x higher

but at least I would except them being transparent about that, not misinforming;

Context retention: Uploaded file content remains in the conversation context for subsequent messages. documentation

Luis

Anonymous26 there is a step where it decided to either pass the full file into the context window or relevant chunks; so yes, the full file is often passed to the context window if not too large.

We've just deployed some changes that should have improved this situation. Please let us know if you're still experience any issues and we'll reopen and investigate further. Thank you!

Anonymous26

btw chatgpt.com also don't pass files into context (but aistudio.google.com does)

maybe kagi assistant settings is the problem (looks like it tries to always be very short and lazy)

igakagi

Gemini has native PDF support and apparently it's more affordable than one might expect (258 tokens per page).
Claude and GPT also have native PDF support, though a bit more expensive.

This was already requested _a year ago, without much of a response 🙁
https://kagifeedback.org/d/6801-gemini-use-document-processing-for-much-better-pdf-handling

someoneiknow

This has been a major annoyance for me too. I have basically given up on uploading PDFs to kagi Assistant. Every now and then, I try it to see if there's any improvement. Example from today's test: https://kagi.com/assistant/a3827fff-0bf4-4b29-803a-840f5b7c5168

The two attached PDFs are the original datasheets from ST's website. Sadly, it failed. For instance, the ILPS22QS clearly mentions the shock resistance on the very first page, and the LPS22HB lists the current consumption on the first page.

Bonarc

+1 on this. 100K characters is a low limit. Also if a second model is just answering the question, what's even the point of the main model.

Also Kagi should indicate when it's not reading the full content as otherwise you get really mediocre results seemingly at random with no indication why.