76

oxlvlnle +1 for prompt at the same time as the upload. I'm using it for creating alt texts, so I'm not really interested in the default output.

    Vlad unstickied the discussion .

      oxlvlnle It is, but the reason why I pay for Unlimited is so that I'm not a financial hit to Kagi, I want them to benefit. So if I have an especially heavy research weekend and some extra cash, I'd love a way to throw it Kagi's way. πŸ˜›

        11 days later

        Text to speech - ultimate could have the ability to play an audio of the printed text. There is an OpenAI API that does a good job, and since it costs money, it would make sense to include it with Ultimate only: https://platform.openai.com/docs/guides/text-to-speech

        User could have the option to choose the voice and ability to download the recording as mp3/aac.

        There are 2 models, regular TTS (costs $15/million characters) and TTS-HD ($30/million characters). I think the regular is pretty sufficient.

          silvenga I like this idea. Or at least some way to explicitly indicate that Ultimate users get some preference in terms of helping to set Kagi priorities/roadmap.

            16 days later

            frin Being able to β€œtalk” to Kagi, especially the fast/expert mode with follow-up questions would be a cool ultimate plan feature.

            3 months later
            2 months later
            Vlad stickied the discussion .

              Maybe a niche usecase, but I've always wanted to use search/assistant from the command line.
              I'm a (n)vim user and dislike having to switch to the browser to search or use assistant.

              I realize that the api limitations are in place to prevent abuse and overuse, which is probably the biggest challenge in exposing this to a cli tool, so I would be perfectly ok with having a lot of throttling restrictions on this as the operations I want to do would be 100% the same as I do on the web, I just want to stay in nvim.

                Elifino I currently use Shell GPT: https://github.com/TheR1D/shell_gpt

                it connects to openai but once an assistant api is available, you could use that as well instead of openai.

                I use it a lot but mainly so it generates commands for me. for example I can
                sgpt -s "extract all even pages from document.pdf and put them in a new folder 'newfolder'"
                (-s means shell command)
                and it'll give me the command and ask if i want to run the command

                  6 days later

                  Thibaultmol Thanks for sharing, I'm using gen.nvim connected to local ollama (I don't trust openai from a privacy perspective), but web search functionality is what I lack. Having this integrate with kagi would be superb πŸ™‚

                    5 days later

                    I feel like priority support would be a great addition to the ultimate plan

                      fxgn While I second this, Kagi team is quick to deal with whatever technical problems. Just message in Discord and usually you will get an answer in a short time, no matter if you are ultimate user or not.

                        oxlvlnle I Third it BUT keep in mind that not everyone uses Discord. A lot of privacy minded folks aren't on Discord for example. So they'd ideally need to have a thing where ultimate subs get their emails high prio

                        a month later

                        Hi. I would like to suggest adding to the AI assistant the ability to send files to the user, for example, word, excel, pdf documents. This is something I really miss in my daily work. πŸ™‚

                          Give ultimate subscribers a way to become anonymous in Kagi ecosystem. Vlad mentioned in another thread that it is possible to do Mullvad style logins but if that comes with a cost. Lock that behind the Ultimate plan and you got a winner! Those that want anonymity get anonymity.. if there are many of those amongst us, Kagi gets an uptick in Ultimate users.

                          a month later

                          I'd really love a native mobile app for the assistant.

                          Even using it as a PWA on my iPhone is very clunky, with some of the assistant's UI elements obscured behind built-in iOS UI elements:

                          ...and being forced to approve microphone access literally EVERY time I want to use it:

                          Bonus points if it could integrate directly with Siri, but I know that would take partnership with Apple, who thus far have only offered ChatGPT, though they've said that more partnerships will come...