Generative AI (GenAI)

AI-generated text detector

  • perplexity: likelihood of sequence of text according to specific language model
  • bilingual evaluation understudy (BLEU): quality of machine translation compared to human translation

commercial:

paper:

testing:

  • Binoculars & GPTZero work well on my text & short prompt generation
    • ✅ classify paraphrase as human
    • ❓ classify long prompt generation as human
    • around 15GB × 2, 0.5s per context window (around 300 token) on A6000

Issues from AI-generated text

  • bot account on social media
  • fake product review
  • academic plagiarism
    • peer review validity

paper on AI-generated text:

  • Monitoring AI-Modified Content at Scale: A Case Study on the Impact of ChatGPT on AI Conference Peer Reviews, Weixin Liang, Zachary Izzo, Yaohui Zhang, Haley Lepp, Hancheng Cao, Xuandong Zhao, Lingjiao Chen, Haotian Ye, Sheng Liu, Zhi Huang, Daniel A. McFarland, James Y. Zou, ICML, 2024; Mapping the Increasing Use of LLMs in Scientific Papers, Weixin Liang, Yaohui Zhang, Zhengxuan Wu, Haley Lepp, Wenlong Ji, Xuandong Zhao, Hancheng Cao, Sheng Liu, Siyu He, Zhi Huang, Diyi Yang, Christopher Potts, Christopher D Manning, James Y. Zou, arXiv, 2024
    • vastly efficient statistical batch detection via MLE of adjective appearance, i.e., word choice
      • trained on human & ChatGPT-generated review from paper & prompt
      • ❓ what prompt for us? generalize to other model?
    • somewhat resistant to paraphrasing
    • inference on 40k review from ICLR 2024, NeurIPS 2023, CoRL 2023, EMNLP 2023
    • review w/ generation seem rushed, cite (use et al.) less
    • ❗ what would set us apart from them?
    • criticism
      • validation unscientific: AI dataset generated same way as training data
      • human may learn word from AI
  • The Rise of AI-Generated Content in Wikipedia, Creston Brooks, Samuel Eggert, Denis Peskoff, EMNLP workshop, 2024
    • use GPTZero & Binoculars w/ Falcon-7B
    • Wikipedia article before & after GPT-3.5
      • Reddit data from Kaggle
    • get lower bound of generation by subtracting previous positive rate
      • ❗ unscientific bc assume paper i.i.d.
    • people likely believe repeated statement
    • did not use Ghostbuster bc need training, Fast-DetectGPT bc lower accuracy
  • The AI Review Lottery: Widespread AI-Assisted Peer Reviews Boost Paper Scores and Acceptance Rates, Giuseppe Russo Latona, Manoel Horta Ribeiro, Tim R. Davidson, Veniamin Veselovsky, Robert West, arXiv, 2024
    • > 16% of 28,000 ICLR 2024 review used AI, w/ higher score
      • threat to peer review process
    • use GPTZero
    • 30 word that predict generation by ~70% accuracy
      • can such word use for screening

news:

  • Some of Substack’s Biggest Newsletters Rely On AI Writing Tools | WIRED
    • 10% of 25 Substack sample seem generated, even by prominent author
      • GPTZero paid to view article
      • ❗ really small sample size
    • some author claim to use LLM to polish not create
    • predict that badge asserting human-made will be common
  • AI Slop Is Flooding Medium | WIRED
    • 40% of post seem generated
      • done by Pangram & Originality separately
      • ❓ how do we compete w/ company
    • ∃ misinformation tracking company, e.g. NewsGuard, that scan for generated text
    • Medium CEO claim most generated post was hardly being read
      • argue generated post are bad writing
    • mention of YouTube get-rich-quick tutorial on generating post/book

Human detection of AI-generated content

Browser extension/add-on

display info on search result/ webpage itself

extension to automatically detect AI-generated text:

why extension for human to vote DNE:

  • user privacy
  • scale and cost
    • peer-to-peer?
  • adversarial attack like click farm
    • can require voter to verify & pass CAPTCHA
    • solution from crowdsourcing platform: moderator on Reddit, VIP on SponsorBlock
  • voting is subjective
    • specific tagging instead, e.g., “Ad”, “AI”, “Scam”
  • (abandoned) similar project Dissenter browser let user comment on any webpage