Retro Contest
Weβre launching a transfer learning contest that measures a reinforcement learning algorithmβs ability to generalize from previous experience.
Log in to bookmark articles and create collections
AI-Powered Learning bringing you YOUR best news
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Weβre launching a transfer learning contest that measures a reinforcement learning algorithmβs ability to generalize from previous experience.
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
On March 3rd, we hosted our firstΒ hackathonΒ with 100 members of the artificial intelligence community.
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Weβve developed a simple meta-learning algorithm called Reptile which works by repeatedly sampling a task, performing stochastic gradient descent on it, and updating the initial parameters towards the final parameters learned on that task. Reptile is the application of the Shortest Descent algorithm...
Log in to bookmark articles and create collections
Weβre providing 6β10 stipends and mentorship to individuals from underrepresented groups to study deep learning full-time for 3 months and open-source a project.
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Weβre releasing eight simulated robotics environments and a Baselines implementation of Hindsight Experience Replay, all developed for our research over the past year. Weβve used these environments to train models which work on physical robots. Weβre also releasing a set of requests for robotics res...
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Come to OpenAIβs office in San Franciscoβs Mission District for talks and a hackathon on Saturday, March 3rd.
Log in to bookmark articles and create collections
Weβre excited to welcome new donors to OpenAI.
Log in to bookmark articles and create collections
Weβve co-authored a paper that forecasts how malicious actors could misuse AI technology, and potential ways we can prevent and mitigate these threats. This paper is the outcome of almost a year of sustained work with our colleagues at the Future of Humanity Institute, the Centre for the Study of Ex...
Log in to bookmark articles and create collections
Weβve designed a method that encourages AIs to teach each other with examples that also make sense to humans. Our approach automatically selects the most informative examples to teach a conceptβfor instance, the best images to describe the concept of dogsβand experimentally we found our approach to...
Log in to bookmark articles and create collections
Weβve built a system for automatically figuring out which object is meant by a word by having a neural network decide if the word belongs to each of about 100 automatically-discovered βtypesβ (non-exclusive categories).
Log in to bookmark articles and create collections
Weβre releasing a new batch ofΒ seven unsolved problemsΒ which have come up in the course of our research at OpenAI.
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Weβre releasing highly-optimized GPU kernels for an underexplored class of neural network architectures: networks with block-sparse weights. Depending on the chosen sparsity, these kernels can run orders of magnitude faster than cuBLAS or cuSPARSE. Weβve used them to attain state-of-the-art results...
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Weβve developed a hierarchical reinforcement learning algorithm that learns high-level actions useful for solving a range of tasks, allowing fast solving of tasks requiring thousands of timesteps. Our algorithm, when applied to a set of navigation problems, discovers a set of high-level actions for...
Log in to bookmark articles and create collections
Our latest robotics techniques allow robot controllers, trained entirely in simulation and deployed on physical robots, to react to unplanned changes in the environment as they solve simple tasks. That is, weβve used these techniques to build closed-loop systems rather than open-loop ones as before.
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Weβve found that self-play allows simulated AIs to discover physical skills like tackling, ducking, faking, kicking, catching, and diving for the ball, without explicitly designing an environment with these skills in mind. Self-play ensures that the environment is always the right difficulty for an...
Log in to bookmark articles and create collections
We show that for the task of simulated robot wrestling, a meta-learning agent can learn to quickly defeat a stronger non-meta-learning agent, and also show that the meta-learning agent can adapt to physical malfunction.
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections
Weβre releasing an algorithm which accounts for the fact that other agents are learning too, and discovers self-interested yet collaborative strategies like tit-for-tat in the iterated prisonerβs dilemma. This algorithm, Learning with Opponent-Learning Awareness (LOLA), is a small step towards agent...
Log in to bookmark articles and create collections
Log in to bookmark articles and create collections