Use AI to remove AI slop, and report to SlopStop

kslays

I'm very excited about the new SlopStop feature! Hopefully it will keep improving, because I just performed the following search and got a lot of slop.
https://kagi.com/search?q=Australia+light+switch+replace

I ran the resulting URLs through Kimi K2 and here is the analysis for 17 of the first results:
8 slop, 5 maybe slop, 4 not slop
https://kagi.com/assistant/102216a7-2847-4828-b81c-e76528f69788

I would love to be able to run this Assistant analysis, wait, and then get a new Kagi search results page of just "high confidence not slop". I don't mind waiting.

The user flow would be:

Perform search, get normal Kagi search results
User selects "hide suspected AI generated content for this search" button
Kagi Assistant analyzes results, highlighting "high confidence" not slop pages, and the user waits while perusing snippets. An animation shows it's working, but the user is free to click a result in the meantime.
All the possible AI results go poof! Ideally they would be removed dynamically without a page reload but a page reload would be better than not having the feature.
Optionally, user selects a button that replaces the original "hide" button, to report all the (now hidden) suspected AI content pages for StopSlop review!

On mobile, the button (which could change from a "analyze and hide" to "report") could go between quick answer and the lenses dropdown, or you could bury it in the Options dropdown below Personalized and Verbatim.

MomentumBuffet

Asking an LLM to identify slop is neither robust nor scalable. It costs a lot of money and energy, and isn't that accurate, as there is an adversarial relationship between identifying and producing slop. When a model gets better at identifying slop, the models to produce it are also getting better (more deceptive).

Telling a model to ingest websites, rate their slop-ness and then discount the bad ones is also not good practice, as the model still "read" the bad websites and will have its output still be informed by the slop.

Kagi SlopStop is purposefully designed to work very differently from what you are suggesting. It's not fully transparent (to make tricking it more difficult), but it has three main components:

Users report slop by hand. You can find the new report button by clicking the shield icon next to any search result.
Non-content signals like how many ads and trackers are on the site.
Content signals like an overuse of em-dashes.

SlopStop was just launched and it will take a while to have a noticeable effect for Search & Assistant. If you've got the time, you could go through the results by hand and doublecheck whether they really are slop. If they are, report them.

kirkmc

I definitely don't want AI to report AI. This is something that needs to be done by humans.

Boomkop3

Ai is a bit bad at detecting Ai unfortunately

kslays

I expected the Assistant to do very poorly and was surprised that for several searches it agreed with my assessment perfectly. I have seen several research summaries showing LLMs cannot identify LLM-generated content, but they seem have nearly perfect accuracy for the slop littering my search results.

The procedure I'm suggesting does not need to scale beyond individual searches, and only run when requested by the user.

There is not much of an adversarial relationship or arms race for this implementation, because Kagi is presently too small of a player for these slop site creators to care. The arms race is present with manual filtering and whatever other methods employed with SlopStop anyway.

Assuming the model purveyors are respecting their contracts with Kagi, with this Assistant detection, they are legally obligated not to "read" (meaning use for training subsequent models) the slop websites or be informed by the slop.

Of course, Kagi SlopStop should work differently from what I'm suggesting. My suggestion is a shortcut that does two things well:

Aggressively filters out likely slop NOW, without relying on reporting
Reports potential slop for manual review by the SlopStop team, so it can be used to improve SlopStop

The biggest and only serious drawback is that it takes a long time after conducting a search. However, I am presently wasting a lot of time wading through slop, so the method I'm proposing would still save me time.

I think perhaps it would be nice to review each item before reporting to SlopStop, but so far my experience testing has shown it to be accurate.