SlopStop is doomed to fail

JanPieter

(Not a bug, not a feature suggestion. Just feedback, and some frustration venting):

Unfortunately the new SlopStop, as currently implemented, is doomed to fail. I predict you will see the amount of reports drop sharply after some initial enthusiasm. (Of course I hope I'm wrong.)

Why? Human psychology. I think on average (which is what you need if you want to reach scale) humans are not wired to put in extra time and effort if they don't experience an immediate benefit to themselves.

Personally, and I can only assume I'm not alone in this, I'm unfortunately not 'motivated' enough to push buttons to report websites when all I experience in return is "Thanks, we'll have a look at it". When I'm searching, I'm prioritising another urgent need: searching & finding information. When I (quickly!) flag a website, it means I don't want to see it ever again. It should immediately be gone for me.

Also, there is no practical difference between AI slop and human (SEO) slop. I don't care who made it. I don't want to see low/no-quality websites (try the query "spotify suddenly stops" for example). Therefore, the process to remove/report both should be the same. Not push button A for suspected AI slop, and press button B for suspected human slop.

Which makes it make more sense to integrate the feature into the already existing 'rank adjustment' (personal filter?) feature. Now people have to press two buttons for two processes: filter and then also report. Which by the way would only work if the adjustment/filter feature hadn't become completely USELESS (to me).

Apparently, and unknowingly, I've reached some "1000 adjustments" limit, so I can't even use Kagi to remove slop sites from my results anymore. Which would be a 100% core Kagi functionality (Better Search)! More important than AI or Translate gimmicks/feautures. If raising the 1000 limit means too much load on the Kagi servers, you should imo invest in client-side implementations, like any common extension blocklist already does. You can then of course still use the lists of websites people block, as input to improve your algorithm for everybody else.

My current process to actually remove slop (of any origin) from my search is this (only works in a desktop browser unfortunately): I use the uBlacklist extension to quickly remove slop from my (future) results. Including entire domains like *.ru or * .pinterest. * for example, something Kagi can't do either. So the two features Kagi now offers don't remove any slop from my personal search results. The Ranking Adjustment broke down, and SlopStop is only a vague promise. Sadly.

carl

You're probably right. Not to mention, that we customers are paying for the service. It is Kagi's job to ensure quality results, without having to burden the users with extra work.

And there's already more SEO spam domains registered every day than Kagi users can report. During the time you spend to report a "slop" website, 10 new ones have already been registered, generated, and put online.

Will Kagi have to go the way of whitelisting domains instead of blacklisting domains to stop spam? Probably yes in the future, I would guess.

MustafaD

carl Will Kagi have to go the way of whitelisting domains instead of blacklisting domains to stop spam? Probably yes in the future, I would guess.

Sadly, whitelisting may turn out necessary if things turn out really gloomy.

Sadly, the same sort of people who would hire others to write 50 blog articles and fill a website with fake comments, today have AI capable of generating a realistic looking site with contents and comments.

I believe AI will fill our Internet with trash more than it ever had. AI-generated ads: as if ads were not enough :|||||

Vlad

We just need enough reports so we can train a ML model to the job for us on massive scale. The idea was never that users will report every piece of slop on the web.

carl

Vlad That makes a whole lot more sense!

oaqxc24v

Vlad any updates?

Pleonasm

Vlad We just need enough reports so we can train a ML model to the job for us on massive scale.

Could Kagi please describe, at a high level, the approach being used to create a model that successfully predicts AI slop in search results? I'm curious to learn more about these efforts....

Pleonasm

While calling the SlopStop initiative “doomed to fail” may be an overstatement, I do agree that the likelihood of success is low for building and deploying a useful machine learning model to detect “deceptive AI-generated text, images, and video inside search results” (Introducing SlopStop). This is not a criticism of Kagi, but an acknowledgement of the difficulty of the problem....

Building a successful model begins with building a robust model training dataset, consisting of web pages, images, and videos that are known to be (a) AI Slop versus (b) Not AI Slop. It may not be easy to accomplish this foundational step, since there is no operational definition of what constitutes AI slop – and, human judgements to identify AI Slop are subjective and not necessarily much better than chance. When the model training dataset is fuzzy, achieving solid model performance will be challenging (and will become even more so as AI models continue to improve.)
Deploying the model to score items on the search engine results page in order to suppress or downrank suspected AI Slop may be prohibitively expensive. Whatever predictors are used as model inputs, those features need to be calculated for each result generated by each search for each customer of Kagi (in contrast to approaches used for single assessments of writing outputs). Simplistic alternatives (e.g., blacklisting domains) may fail to be as dynamic and comprehensive as needed, and crawling the web to score each domain on a periodic basis does not seem feasible.
Finally, consider that the criteria defining “success” for predicting AI Slop is high, if it is to be a practical and useful solution. The model must exhibit high precision (i.e., a high proportion of predicted AI Slop is in fact AI Slop) and high sensitivity (i.e., a high proportion of actual AI Slop is correctly identified as AI Slop). If search results are too frequently misclassified as AI Slop or Not AI Slop, customers will likely become quickly disappointed and disable the feature.

I do wish Kagi well on this very critical initiative. But, I am not optimistic. Perhaps someone from Kagi can comment and share reasons why the company is hopeful that the SlopStop initiative will succeed? (Thank you.)

P.S.: If I am misunderstanding how this initiative is intended to work, please do clarify and correct.

luked

Would SlopStop be better with a community-driven element to it? Let us communally mark and unmark content as AI generated a community, in addition to whatever ML models are planned.

High level thoughts on potential implementations:

Assign weights to trusted individuals for correctly marking content as slop.
The more content an individual accurately marks as slop, the more their contribution is weighted.
If a resource gets a sufficient quantity of total community weight marked as slop, it is automatically designated as slop.
Create a slop queue that lets volunteers review reports of slop. I'd happily throw in 3 minutes of my time a day to do this. Which doesn't sound much, but multiplied by 1000 people? That would probably scale reasonably well.

k7absp2h

Searched "cram it meaning"

There are 15 results on the first page.

The 2nd, 11th, 14th, and 15th results are slop; 26% of the results are garbage.

I can ignore the 11th, 14th, and 15th, but it is unacceptable for the second result to be slop.

Tecuatl

I agree with all of luked's suggestions above. I have been diligently marking sites as slop and I wish there was more of a concrete outcome – after all, I'm not doing it for my own benefit because I just block the domains anyway. In the other thread it was warned that "Voting gives the risk of gaming the system for nefarious means", but surely one of the big benefits of Kagi is that it's so niche nobody really bothers to manipulate it for SEO! I trust other Kagi users and if multiple other longstanding subscribers flag a domain as slop I would like to benefit from that input, even if it means accepting the microscopic risk that some reputation management agency/Russian propaganda farm is doing it to suppress the website.

carl

Tecuatl

The risk is not from SEO or reputation management agencies. The risk is from digital activists who abuse the system to censor viewpoints and people. And these activists have nothing but time on their hands, and are constantly seeking to infiltrate even small projects with their agenda. They are extremely hungry for any kind of minuscule power they can achieve.

Case in point: Some of the scam website block lists for adblockers also include completely legitimate websites as a part of the author's political activism. Completely admitted by the authors themselves to be unrelated to scams.

Pleonasm

Tecuatl I have been diligently marking sites as slop and I wish there was more of a concrete outcome – after all, I'm not doing it for my own benefit....

What is the precise end-in-mind "concrete outcome" envisioned by Kagi? Given that "99.9999% of {Kagi} searches are unique" (see here), and given that the quantity of Kagi users is only about 68,000 distributed across the globe, what is the probability that User B will benefit from User A having previously marked a specific website as AI slop?

In general, I suspect that the benefits accruing to one Kagi user from the efforts of other Kagi users designating websites as AI slop is minuscule - if the intent by Kagi is to build a curated blacklist of AI slop.

I encourage Kagi to be more transparent about the current state and future plans of the SlopStop initiative by providing periodic updates. The more the community understands about the progress of the initiative, more likely the community is to engage in supporting the effort (IMHO).

P.S.: To be clear, Kagi deserves kudos for attempting to address the AI slop problem - whether they are ultimately successful or not.

Pleonasm

Tecuatl I trust other Kagi users and if multiple other longstanding subscribers flag a domain as slop....

There is an assumption that we are capable of distinguishing human-authored text and AI slop. When the AI slop is egregious, this is likely a safe assumption. But, when the AI slop is more nuanced and of a higher quality, the assumption may be questionable.

In one study, participants were asked to classify stories, news articles, and recipes as being written by humans or AI - and, their classification accuracy was no better than random. Since the quality of AI output has increased massively since this research was done in 2021, the problem is likely to be even more challenging today.

MustafaD

OK This is a very interesting discussion 😋.
Here is my opinion.

1.

Even Google returns AI slop often (specifically those AI-generated videos).
Google is often fooled by auto-generated PDF files claiming to be books, but contain nothing but metadata and blurbs. This was commonplace even before AI (these are scrape-and-generate scripts).
And given Kagi indexes are based on those like of Google, the presence of AI slop is understandable.
However, Kagi is in a favorable position of being able to filter out the slop as many users here expect.
As Vlad mentioned, a ML model can be a very effective solution here.

2.

Being able to report AI slop, I feel empowered.
This is the kind of feature I love and expect from a community-serving service like Kagi.
This is not to say it is perfect, but to emphasize that the presence of such a feature, even if imperfect, is greatly valued.

3.

Now to the most critical part of the discussion (and this is really exciting :D):
If you let users tag contents themselves, you run into many problems:
― malicious users will report useful and valuable results
― biased users ('activists') will report contents they do not like, even if it is useful or acceptable to others
― unethical e-marketers will try to shove down competing results and push up theirs (I've seen this on a local real estate community ads site).
― confused or presumptuous users may mistakenly report good content.
― some users may accidentally report a site.

For the last case, a simple UI feature can fix that: ability to retoggle the report off (this is faster than visiting the Kagi Settings page and looking for that entry).

As for relying on user contributions: REVIEW, as currently made, rather than automated, is the best solution. If volume of reported contents grows large or huge, reliance on automated tools (eg: trained ML models from HUMAN REVIEWS) is probably effective.

One user suggested reputation points for users based on their contributions: the more accurate their reports, the more weigh their report gets. This can be used as a weighing metric in evaluation of a reported item iff it is too difficult to identify if it is slop. Another metric would be if it has been reported by other users. This would only apply to very sly AI generated content. At the present time, AI content is easy to identify to require that level of heuristics.
But this kind of heuristic analysis may be necessary if things get very gloomy. For the fun of it, here is a suggested algorithm: progressing in checks until a certain level of decision confidence is reached: content itself → analyzing other contents by channel/website (ie: stepping up from content to content author) → reporter reputation → reported by others and their mean reputation? → human review.

The user also suggested a voluntary community effort to review contents. I think this is very prone to abuse. As mentioned earlier, there will always be someone trying to downrank competing results, increase their karma points by reviewing blindly, etc. This is the sad reality of Campbell's observation (any metric used for evaluation can become a target for abuse).

The good thing here is, given Kagi is a paid service with a relatively small user base, it is less likely for such probabilities to manifest. This is unless there is targeted or intentional trolling by someone willing to pay just to do that (or be hired). As the user base grows, the service gains popularity, and/or the price lowers, the risk increases.

So what am I recommending then?

Basically:

Keep the 'Report as AI-generated' feature. It is really empowering, even if imperfect.
Require human review of reports to avoid abuse of the feature. If volume is huge, ML assistance may help.
Turning review into a communal effort (ie: giving power to the community to influence results), due to Campbell's law, can be a really bad idea, especially as user base grows and becomes more diversified, as there will always be trolls, biased users, unethical marketers, karma seekers, etc.
We expect Kagi to do a better job, and believe it is in a favorable position to, filter out AI contents from search results. Utilizing ML, as suggested by Vlad, sounds like a great idea.

All the best.

MustafaD

Pleonasm True. The good thing is, most marketers & scammers who use AI to promote fluff, out of nature, do so badly as well. So I think there will generally be consistent tell tale signs for forms of AI generated contents and AI-gen ads.

As for professionally tailored AI contents with intentional covering, it definitely has already become really hard to distinguish AI from real.

I think we'll be fine spotting the obvious. As for difficult cases, I think the effort of evaluation exceeds the effort of just reading the contents :D

MustafaD

I think the goal is not to eradicate AI content, or even be really good at spotting it.

I think like a simple water filter cartridge, the goal is to simply downrank or remove AI stuff from results: the most obvious stuff.

Because users report what they come across in reality, this makes the detection system adaptive to how AI-gen content evolves over time.

If a user has a hard time judging AI from real, they are better off just consuming the contents (rather than evaluating it). So this wouldn't count.
In fact, if the detection system becomes too good at spotting human-looking AI contents, it may risk accidentally tagging human contents as AI!

Because Kagi may use an ML model based on verified user detections, the model can be used to auto-detect very common AI-gen patterns. Also, because of the continual stream of user detections, new and novel AI content formats get learned over time.

I think it sounds like a stable model.

Here is what I would like to recommend:

Verified user reported contents can be omitted from search results (behavior depends on user preference: show, downrank, hide; as already is available in Settings 😊👍️💯).
ML detected contents can be shown in results, but with a warning indicator (as already done with high-tracker websites). This encourages the user to be more cautious with the contents, and if they report it, this assures that the ML detection was correct. Maybe no human review is needed for these sort of reports then. Or maybe human review is lower in priority (and quicker) for these.

If Kagi accepts the proposal to report fake malicious (phishing) contents, I think the same work model used for AI gen can be used here. Only what differs is the data fed in each path.

I think this collaboration between human users – Kagi human verification (of user submissions) – ML model (trained on Kagi verified data) for warning users, is a very practical, sustainable, predictable, adaptive (to new black SEO tactics), and hence robust model.

I think it would work for just any category of undesirable contents, be it AI-gen, fake malicious pages, etc.