Surveying Safety-relevant AI Characteristics

Abstract

The current analysis in the AI safety literature usually combines a risk or safety issue (e.g., interruptibility) with a particular paradigm for an AI agent (e.g., reinforcement learning). However, there is currently no survey of safety-relevant characteristics of AI systems that may reveal neglected areas of research or suggest to developers what design choices they could make to avoid or minimise certain safety concerns. In this paper, we take a first step towards delivering such a survey, from two angles. The first features AI system characteristics that are already known to be relevant to safety concerns, including internal system characteristics, characteristics relating to the effect of the external environment on the system, and characteristics relating to the effect of the system on the target environment. The second presents a brief survey of a broad range of AI system characteristics that could prove relevant to safety research, including types of interaction, computation, integration, anticipation, supervision, modification, motivation and achievement. This survey enables further work in exploring system characteristics and design choices that affect safety concerns.

Read full paper

Surveying Safety-relevant AI Characteristics

Related team members

Related research areas

Risks from Artificial Intelligence

Related Events

SafeAI 2019

Related News

Six Month Report November 2018 - April 2019

SafeAI Workshop: 22 AI safety papers

Surveying Safety-relevant AI Characteristics

Shahar Avin

Seán Ó hÉigeartaigh

Risks from Artificial Intelligence

SafeAI 2019

Six Month Report November 2018 - April 2019

SafeAI Workshop: 22 AI safety papers