This Week in Responsible AI

Subscribe
Archives
July 22, 2024

This Week in Responsible AI, Jul 22, 2024

Jailbreaks

  • OpenAI’s latest model will block the ‘ignore all previous instructions’ loophole
  • Refuse Whenever You Feel Unsafe: Improving Safety in LLMs via Decoupled Refusal Training

Privacy

  • Trust No Bot: Discovering Personal Disclosures in Human-LLM Conversations in the Wild (PDF)
  • The app that promised to ‘use AI to weed out daters with STIs’ has been shut down (CW: potentially NSFW)

Hype

  • Everyone Is Judging AI by These Tests. But Experts Say They’re Close to Meaningless
  • Large Models of What? Mistaking Engineering Achievements for Human Linguistic Agency
  • Questionable practices in machine learning
  • A Sanity Check on ‘Emergent Properties’ in Large Language Models

Social algorithms

  • We unleashed Facebook and Instagram’s algorithms on blank accounts. They served up sexism and misogyny
  • I Changed My Race to White on Hinge And got better matches. Is the algorithm the problem, or the men?

Law/Policy

  • Meta will withhold multimodal AI models from the EU amid regulatory uncertainty
  • OpenAI Dropped From First Ever AI Programming Copyright Lawsuit
  • The AI Executive Order through the lens of the AI Index
  • Biden’s top tech adviser says AI is a ‘today problem’

Copying

  • Academic authors 'shocked' after Taylor & Francis sells access to their research to Microsoft AI
  • Figma explains how its AI tool ripped off Apple’s design
  • European Innovation Council: Artificial intelligence and copyright: use of generative AI tools to develop new content
  • Disney’s internal Slack was leaked by hackers mad about AI

Other

  • Want to spot a deepfake? Look for the stars in their eyes
  • Data workers detail exploitation by tech industry in DAIR report
  • AI and facial recognition tools in open-source intelligence
  • Selfie-based authentication raises eyebrows among infosec experts
  • AI AI Bias: Large Language Models Favor Their Own Generated Content
  • Sustainable AI: a contradiction in terms?

Compiled by Leif Hancox-Li

Don't miss what's next. Subscribe to This Week in Responsible AI:
Powered by Buttondown, the easiest way to start and grow your newsletter.