- Edited
I uploaded a 272 page PDF and started asking questions about it. Unlike for shorter documents, already the initial summary was wrong, it did correct itself though after I mentioned it:
Answers to my questions about the document stayed pretty vague though or contained outright mistakes (like that were no programming examples in the book) and when I asked it "What further books" to read on the subject, it told me it cannot answer me that, as the book does not mention sources, even though the book contained references.
Examples as well as references were pretty late in the book, so my theory is, the book silently crossed the token limit of whatever LLM was in use, so it never even saw the pages which would have helped it answer my questions.
So the suggestion:
If whatever you upload crosses the token limit, show an estimate how many percent/how many pages of the document were actually used and maybe suggest to only upload smaller parts.
thread: K0v5HQOnOZCOqNs8udvj5IvS0TGV3eCP