](https://deep-paper.org/en/paper/file-3528/images/cover.png)
Breaking the Watchdog: How RAFT Generates Realistic Attacks to Fool AI Detectors
The release of Large Language Models (LLMs) like ChatGPT and LLaMA has fundamentally changed how we generate text. From writing emails to coding, the utility is undeniable. However, this power comes with a shadow: academic dishonesty, disinformation campaigns, and sophisticated phishing. To counter this, a new industry of “AI Detectors” has emerged—tools designed to distinguish between human and machine-written content. But how robust are these guardians? In this post, we dive deep into a paper titled “RAFT: Realistic Attacks to Fool Text Detectors,” which proposes a novel framework for “red-teaming” (attacking) these detectors. Unlike previous methods that often produce garbled or grammatically incorrect text, RAFT generates attacks that are essentially invisible to the human eye but confusing enough to break the best detectors available. ...
](https://deep-paper.org/en/paper/file-3527/images/cover.png)
](https://deep-paper.org/en/paper/file-3526/images/cover.png)
](https://deep-paper.org/en/paper/2410.02027/images/cover.png)
](https://deep-paper.org/en/paper/2409.16341/images/cover.png)
](https://deep-paper.org/en/paper/file-3523/images/cover.png)
](https://deep-paper.org/en/paper/2410.10449/images/cover.png)
](https://deep-paper.org/en/paper/2310.09259/images/cover.png)
](https://deep-paper.org/en/paper/2408.01046/images/cover.png)
](https://deep-paper.org/en/paper/2406.05707/images/cover.png)
](https://deep-paper.org/en/paper/2402.11291/images/cover.png)
](https://deep-paper.org/en/paper/2409.20243/images/cover.png)
](https://deep-paper.org/en/paper/2410.04075/images/cover.png)
](https://deep-paper.org/en/paper/2406.16330/images/cover.png)
](https://deep-paper.org/en/paper/2410.22642/images/cover.png)
](https://deep-paper.org/en/paper/2407.01119/images/cover.png)
](https://deep-paper.org/en/paper/2404.18424/images/cover.png)
](https://deep-paper.org/en/paper/2405.01535/images/cover.png)
](https://deep-paper.org/en/paper/2410.08027/images/cover.png)
](https://deep-paper.org/en/paper/2410.05210/images/cover.png)