Adversarial Examples that Fool both Computer Vision and Time-Limited Humans

Gamaleldin Fathy Elsayed

Shreya Shankar

Brian Cheung

Nicolas Papernot

Alex Kurakin

Ian Goodfellow

Jascha Sohl-dickstein

NeurIPS (2018)

Download Google Scholar

Abstract

Machine learning models are vulnerable to adversarial examples: small changes to images can cause computer vision models to make mistakes such as identifying a school bus as an ostrich. However, it is still an open question whether humans are prone to similar mistakes. Here, we address this question by leveraging recent techniques that transfer adversarial examples from computer vision models with known parameters and architecture to other models with unknown parameters and architecture, and by matching the initial processing of the human visual system. We find that adversarial examples that strongly transfer across computer vision models influence the classifications made by time-limited human observers.

Research Areas

Machine Perception
General Science

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations  & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Adversarial Examples that Fool both Computer Vision and Time-Limited Humans

Abstract

Research Areas

Meet the teams driving innovation

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Adversarial Examples that Fool both Computer Vision and Time-Limited Humans

Abstract

Research Areas

Meet the teams driving innovation

AI/ML Foundations  & Capabilities