Isabella News - Clean European News

❌

Dismiss

🔖

Bookmark

OpenAI Blog • 8 years ago • EN

Open ↗

Domain randomization and generative models for robotic grasping

Log in to bookmark articles and create collections

❌

Dismiss

🔖

Bookmark

OpenAI Blog • 8 years ago • EN

Open ↗

Competitive self-play

We’ve found that self-play allows simulated AIs to discover physical skills like tackling, ducking, faking, kicking, catching, and diving for the ball, without explicitly designing an environment with these skills in mind. Self-play ensures that the environment is always the right difficulty for an...

Log in to bookmark articles and create collections

❌

Dismiss

🔖

Bookmark

OpenAI Blog • 8 years ago • EN

Open ↗

OpenAI Baselines: ACKTR & A2C

We’re releasing two new OpenAI Baselines implementations: ACKTR and A2C. A2C is a synchronous, deterministic variant of Asynchronous Advantage Actor Critic (A3C) which we’ve found gives equal performance. ACKTR is a more sample-efficient reinforcement learning algorithm than TRPO and A2C, and requir...

Log in to bookmark articles and create collections

❌

Dismiss

🔖

Bookmark

OpenAI Blog • 8 years ago • EN

Open ↗

Dota 2

We’ve created a bot which beats the world’s top professionals at 1v1 matches of Dota 2 under standard tournament rules. The bot learned the game from scratch by self-play, and does not use imitation learning or tree search. This is a step towards building AI systems which accomplish well-defined goa...

Log in to bookmark articles and create collections

❌

Dismiss

🔖

Bookmark

OpenAI Blog • 8 years ago • EN

Open ↗

Gathering human feedback

RL-Teacher is an open-source implementation of our interface to train AIs via occasional human feedback rather than hand-crafted reward functions. The underlying technique was developed as a step towards safe AI systems, but also applies to reinforcement learning problems with rewards that are hard...

Log in to bookmark articles and create collections

❌

Dismiss

🔖

Bookmark

OpenAI Blog • 8 years ago • EN

Open ↗

Proximal Policy Optimization

We’re releasing a new class of reinforcement learning algorithms, Proximal Policy Optimization (PPO), which perform comparably or better than state-of-the-art approaches while being much simpler to implement and tune. PPO has become the default reinforcement learning algorithm at OpenAI because of i...

Log in to bookmark articles and create collections

❌

Dismiss

🔖

Bookmark

OpenAI Blog • 8 years ago • EN

Open ↗

Robust adversarial inputs

We’ve created images that reliably fool neural network classifiers when viewed from varied scales and perspectives. This challenges a claim from last week that self-driving cars would be hard to trick maliciously since they capture images from multiple scales, angles, perspectives, and the like.

Log in to bookmark articles and create collections

❌

Dismiss

🔖

Bookmark

OpenAI Blog • 8 years ago • EN

Open ↗

Learning from human preferences

One step towards building safe AI systems is to remove the need for humans to write goal functions, since using a simple proxy for a complex goal, or getting the complex goal a bit wrong, can lead to undesirable and even dangerous behavior. In collaboration with DeepMind’s safety team, we’ve develop...

Log in to bookmark articles and create collections

❌

Dismiss

🔖

Bookmark

OpenAI Blog • 8 years ago • EN

Open ↗

Learning to cooperate, compete, and communicate

Multiagent environments where agents compete for resources are stepping stones on the path to AGI. Multiagent environments have two useful properties: first, there is a natural curriculum—the difficulty of the environment is determined by the skill of your competitors (and if you’re competing agains...

Log in to bookmark articles and create collections

❌

Dismiss

🔖

Bookmark

OpenAI Blog • 8 years ago • EN

Open ↗

OpenAI Baselines: DQN

We’re open-sourcing OpenAI Baselines, our internal effort to reproduce reinforcement learning algorithms with performance on par with published results. We’ll release the algorithms over upcoming months; today’s release includes DQN and three of its variants.

Log in to bookmark articles and create collections

❌

Dismiss

🔖

Bookmark

OpenAI Blog • 8 years ago • EN

Open ↗

Robots that learn

We’ve created a robotics system, trained entirely in simulation and deployed on a physical robot, which can learn a new task after seeing it done once.

Log in to bookmark articles and create collections

❌

Dismiss

🔖

Bookmark

OpenAI Blog • 8 years ago • EN

Open ↗

Roboschool

We are releasing Roboschool: open-source software for robot simulation, integrated with OpenAI Gym.

Log in to bookmark articles and create collections

❌

Dismiss

🔖

Bookmark

OpenAI Blog • 8 years ago • EN

Open ↗

Unsupervised sentiment neuron

We’ve developed an unsupervised system which learns an excellent representation of sentiment, despite being trained only to predict the next character in the text of Amazon reviews.

Log in to bookmark articles and create collections

❌

Dismiss

🔖

Bookmark

OpenAI Blog • 8 years ago • EN

Open ↗

Spam detection in the physical world

We’ve created the world’s first Spam-detecting AI trained entirely in simulation and deployed on a physical robot.

Log in to bookmark articles and create collections

❌

Dismiss

🔖

Bookmark

OpenAI Blog • 8 years ago • EN

Open ↗

Distill

We’re excited to support today’s launch of Distill, a new kind of journal aimed at excellent communication of machine learning results (novel or existing).

Log in to bookmark articles and create collections

❌

Dismiss

🔖

Bookmark

OpenAI Blog • 8 years ago • EN

Open ↗

Learning to communicate

In this post we’ll outline new OpenAI research in which agents develop their own language.

Log in to bookmark articles and create collections

❌

Dismiss

🔖

Bookmark

OpenAI Blog • 8 years ago • EN

Open ↗

Team update

The OpenAI team is now 45 people. Together, we’re pushing the frontier of AI capabilities—whether by validating novel ideas, creating new software systems, or deploying machine learning on robots.

Log in to bookmark articles and create collections

❌

Dismiss

🔖

Bookmark

OpenAI Blog • 9 years ago • EN

Open ↗

Faulty reward functions in the wild

Reinforcement learning algorithms can break in surprising, counterintuitive ways. In this post we’ll explore one failure mode, which is where you misspecify your reward function.

Log in to bookmark articles and create collections

❌

Dismiss

🔖

Bookmark

OpenAI Blog • 9 years ago • EN

Open ↗

Universe

We’re releasing Universe, a software platform for measuring and training an AI’s general intelligence across the world’s supply of games, websites and other applications.

Log in to bookmark articles and create collections

❌

Dismiss

🔖

Bookmark

OpenAI Blog • 9 years ago • EN

Open ↗

OpenAI and Microsoft

We’re working with Microsoft to start running most of our large-scale experiments on Azure.

Log in to bookmark articles and create collections

❌

Dismiss

🔖

Bookmark

OpenAI Blog • 9 years ago • EN

Open ↗

Semi-supervised knowledge transfer for deep learning from private training data

Log in to bookmark articles and create collections

❌

Dismiss

🔖

Bookmark

OpenAI Blog • 9 years ago • EN

Open ↗

Report from the self-organizing conference

Last week we hosted over a hundred and fifty AI practitioners in our offices for our first self-organizing conference on machine learning.

Log in to bookmark articles and create collections

❌

Dismiss

🔖

Bookmark

OpenAI Blog • 9 years ago • EN

Open ↗

Machine Learning Unconference

The latest information about the Unconference is now available at the Unconference wiki, which will be periodically updated with more information for attendees.

Log in to bookmark articles and create collections

❌

Dismiss

🔖

Bookmark

OpenAI Blog • 9 years ago • EN

Open ↗

Concrete AI safety problems

We (along with researchers from Berkeley and Stanford) are co-authors on today’s paper led by Google Brain researchers, Concrete Problems in AI Safety. The paper explores many research problems around ensuring that modern machine learning systems operate as intended.

Log in to bookmark articles and create collections

❌

Dismiss

🔖

Bookmark

OpenAI Blog • 9 years ago • EN

Open ↗

OpenAI technical goals

OpenAI’s mission is to build safe AI, and ensure AI’s benefits are as widely and evenly distributed as possible.

Log in to bookmark articles and create collections

❌

Dismiss

🔖

Bookmark

OpenAI Blog • 9 years ago • EN

Open ↗

Team update

We’d like to welcome the latest set of team members to OpenAI (and we’re still hiring!)

Log in to bookmark articles and create collections

❌

Dismiss

🔖

Bookmark

OpenAI Blog • 9 years ago • EN

Open ↗

Adversarial training methods for semi-supervised text classification

Log in to bookmark articles and create collections

❌

Dismiss

🔖

Bookmark

OpenAI Blog • 9 years ago • EN

Open ↗

OpenAI Gym Beta

We’re releasing the public beta of OpenAI Gym, a toolkit for developing and comparing reinforcement learning (RL) algorithms. It consists of a growing suite of environments (from simulated robots to Atari games), and a site for comparing and reproducing results.

Log in to bookmark articles and create collections

❌

Dismiss

🔖

Bookmark

OpenAI Blog • 9 years ago • EN

Open ↗

Weight normalization: A simple reparameterization to accelerate training of deep neural networks

Log in to bookmark articles and create collections

❌

Dismiss

🔖

Bookmark

OpenAI Blog • 10 years ago • EN

Open ↗

Introducing OpenAI

OpenAI is a non-profit artificial intelligence research company. Our goal is to advance digital intelligence in the way that is most likely to benefit humanity as a whole, unconstrained by a need to generate financial return. Since our research is free from financial obligations, we can better focus...

Log in to bookmark articles and create collections

Isabella's AI-Powered News Aggregator

Sentiment Analysis

Domain randomization and generative models for robotic grasping

Competitive self-play

OpenAI Baselines: ACKTR & A2C

Dota 2

Gathering human feedback

Proximal Policy Optimization

Robust adversarial inputs

Learning from human preferences

Learning to cooperate, compete, and communicate

OpenAI Baselines: DQN

Robots that learn

Roboschool

Unsupervised sentiment neuron

Spam detection in the physical world

Distill

Learning to communicate

Team update

Faulty reward functions in the wild

Universe

OpenAI and Microsoft

Semi-supervised knowledge transfer for deep learning from private training data

Report from the self-organizing conference

Machine Learning Unconference

Concrete AI safety problems

OpenAI technical goals

Team update

Adversarial training methods for semi-supervised text classification

OpenAI Gym Beta

Weight normalization: A simple reparameterization to accelerate training of deep neural networks

Introducing OpenAI

🔥 Trending Topics

💡 Popular Searches

Sentiment Analysis

🔥 Trending Topics

💡 Popular Searches

Install Isabella News

Share Article

Article Title