Isabella's AI-Powered News Aggregator

AI-Powered Learning bringing you YOUR best news

Last checked: 1 minute ago

Sentiment Analysis

1282
Positive
473
Negative
790
Neutral
❌
Dismiss
πŸ”–
Bookmark
OpenAI Blog β€’ β€’ EN
Open β†—

Learning from human preferences

One step towards building safe AI systems is to remove the need for humans to write goal functions, since using a simple proxy for a complex goal, or getting the complex goal a bit wrong, can lead to undesirable and even dangerous behavior. In collaboration with DeepMind’s safety team, we’ve develop...

Log in to bookmark articles and create collections

❌
Dismiss
πŸ”–
Bookmark
OpenAI Blog β€’ β€’ EN
Open β†—

Learning to cooperate, compete, and communicate

Multiagent environments where agents compete for resources are stepping stones on the path to AGI. Multiagent environments have two useful properties: first, there is a natural curriculumβ€”the difficulty of the environment is determined by the skill of your competitors (and if you’re competing agains...

Log in to bookmark articles and create collections

❌
Dismiss
πŸ”–
Bookmark
OpenAI Blog β€’ β€’ EN
Open β†—

OpenAI Baselines: DQN

We’re open-sourcing OpenAI Baselines, our internal effort to reproduce reinforcement learning algorithms with performance on par with published results. We’ll release the algorithms over upcoming months; today’s release includes DQN and three of its variants.

Log in to bookmark articles and create collections

❌
Dismiss
πŸ”–
Bookmark
OpenAI Blog β€’ β€’ EN
Open β†—

Robots that learn

We’ve created a robotics system, trained entirely in simulation and deployed on a physical robot, which can learn a new task after seeing it done once.

Log in to bookmark articles and create collections

❌
Dismiss
πŸ”–
Bookmark
OpenAI Blog β€’ β€’ EN
Open β†—

Roboschool

We are releasing Roboschool: open-source software for robot simulation, integrated with OpenAI Gym.

Log in to bookmark articles and create collections

❌
Dismiss
πŸ”–
Bookmark
OpenAI Blog β€’ β€’ EN
Open β†—

Unsupervised sentiment neuron

We’ve developed an unsupervised system which learns an excellent representation of sentiment, despite being trained only to predict the next character in the text of Amazon reviews.

Log in to bookmark articles and create collections

❌
Dismiss
πŸ”–
Bookmark
OpenAI Blog β€’ β€’ EN
Open β†—

Spam detection in the physical world

We’ve created the world’s first Spam-detecting AI trained entirely in simulation and deployed on a physical robot.

Log in to bookmark articles and create collections

❌
Dismiss
πŸ”–
Bookmark
OpenAI Blog β€’ β€’ EN
Open β†—

Evolution strategies as a scalable alternative to reinforcement learning

We’ve discovered that evolution strategies (ES), an optimization technique that’s been known for decades, rivals the performance of standard reinforcement learning (RL) techniques on modern RL benchmarks (e.g. Atari/MuJoCo), while overcoming many of RL’s inconveniences.

Log in to bookmark articles and create collections

❌
Dismiss
πŸ”–
Bookmark
OpenAI Blog β€’ β€’ EN
Open β†—

Distill

We’re excited to support today’s launch of Distill, a new kind of journal aimed at excellent communication of machine learning results (novel or existing).

Log in to bookmark articles and create collections

❌
Dismiss
πŸ”–
Bookmark
OpenAI Blog β€’ β€’ EN
Open β†—

Learning to communicate

In this post we’ll outline new OpenAI research in which agents develop their own language.

Log in to bookmark articles and create collections

❌
Dismiss
πŸ”–
Bookmark
OpenAI Blog β€’ β€’ EN
Open β†—

Attacking machine learning with adversarial examples

Adversarial examples are inputs to machine learning models that an attacker has intentionally designed to cause the model to make a mistake; they’re like optical illusions for machines. In this post we’ll show how adversarial examples work across different mediums, and will discuss why securing syst...

Log in to bookmark articles and create collections

❌
Dismiss
πŸ”–
Bookmark
OpenAI Blog β€’ β€’ EN
Open β†—

Team update

The OpenAI team is now 45 people. Together, we’re pushing the frontier of AI capabilitiesβ€”whether by validating novel ideas, creating new software systems, or deploying machine learning on robots.

Log in to bookmark articles and create collections

❌
Dismiss
πŸ”–
Bookmark
OpenAI Blog β€’ β€’ EN
Open β†—

Faulty reward functions in the wild

Reinforcement learning algorithms can break in surprising, counterintuitive ways. In this post we’ll explore one failure mode, which is where you misspecify your reward function.

Log in to bookmark articles and create collections

❌
Dismiss
πŸ”–
Bookmark
OpenAI Blog β€’ β€’ EN
Open β†—

Universe

We’re releasing Universe, a software platform for measuring and training an AI’s general intelligence across the world’s supply of games, websites and other applications.

Log in to bookmark articles and create collections

❌
Dismiss
πŸ”–
Bookmark
OpenAI Blog β€’ β€’ EN
Open β†—

OpenAI and Microsoft

We’re working with Microsoft to start running most of our large-scale experiments on Azure.

Log in to bookmark articles and create collections