Edgar Cervantes / Android Authority
TL;DR
- An Android Authority teardown has revealed that Reddit will use an AI mannequin for detecting harassment.
- The mannequin is skilled on content material that was beforehand flagged for violating Reddit’s phrases.
We’ve seen massive language fashions (LLMs) used for a wide range of options within the final 12 months or so, from textual content/picture technology to digital assistants and past. Now, it appears like we are able to add yet one more use case to the listing due to Reddit.
An APK teardown helps predict options which will arrive on a service sooner or later based mostly on work-in-progress code. Nevertheless, it’s potential that such predicted options could not make it to a public launch.
A teardown of model 2024.10.0 of the Reddit app for Android has revealed that Reddit is now utilizing an LLM to detect harassment on the platform. You possibly can view the related strings under.
Code
<string identify="hcf_answer_how_model_trained">The harassment mannequin is an massive language mannequin (LLM) that's skilled on content material that our enforcement groups have discovered to be violating. Moderator actions are additionally an enter in how the mannequin is skilled.</string>
<string identify="hcf_faq_how_model_trained">How is the harassment mannequin skilled?</string>
Reddit additionally up to date its assist web page every week in the past to say using an AI mannequin as a part of its harassment filter.
“The filter is powered by a Massive Language Mannequin (LLM) that’s skilled on moderator actions and content material eliminated by Reddit’s inside instruments and enforcement groups,” reads an excerpt from the web page.
Both manner, it appears like moderators have one other device of their arsenal to combat objectionable content material on Reddit. Will this truly do an incredible job of flagging content material, although? We’ll simply have to attend and see.