site:www.cs.utexas.edu

www.cs.utexas.edu13d

Computer Science Doctoral Student Earns Google Fellowship

Mina Huh, a computer science Ph.D. student at UT Austin, has been awarded a Google Ph.D. Fellowship, the company announced on ...

www.cs.utexas.edu3d

Data-Efficient Policy Evaluation Through Behavior Policy Search

We consider the task of evaluating a policy for a Markov decision process (MDP).The standard unbiased technique for evaluating a policy is to deploy the policyand observe its performance. We show that ...

www.cs.utexas.edu8d

Overlapping Layered Learning

Patrick MacAlpine and Peter Stone.

www.cs.utexas.edu10d

E. Allen Emerson

E. Allen Emerson has a longstanding interest in formal methods for establishing program correctness. This was inspired in part by reading in the mid-1970's a CACM paper by Tony Hoare "Proof of Program ...

www.cs.utexas.edu10d

David Harwath

My research interests are in the area of machine learning for speech, language, and sound processing. I am particularly interested in multimodality and unsupervised ...

www.cs.utexas.edu8d

Transfer Learning for Reinforcement Learning Domains: A Survey

Transfer Learning for Reinforcement Learning Domains: A Survey. Matthew E. Taylor and Peter Stone. Journal of Machine Learning Research, 10(1):1633–1685, 2009.

www.cs.utexas.edu8d

Gaussian processes for sample efficient reinforcement learning with RMAX-like exploration

Gaussian processes for sample efficient reinforcement learning with RMAX-like exploration. Tobias Jung and Peter Stone. @InProceedings{ECML10-jung, author = "Tobias Jung and Peter Stone", title = ...

www.cs.utexas.edu8d

Multiagent Traffic Management: A Reservation-Based Intersection Control Mechanism

Multiagent Traffic Management: A Reservation-Based Intersection Control Mechanism. Kurt Dresner and Peter Stone. In The Third International Joint Conference on Autonomous Agents and Multiagent Systems ...

www.cs.utexas.edu8d

TEXPLORE: Real-Time Sample-Efficient Reinforcement Learning for Robots

TEXPLORE: Real-Time Sample-Efficient Reinforcement Learning for Robots. Todd Hester and Peter Stone. Machine Learning, 90(3):385–429, 2013.

www.cs.utexas.edu8d

Design and Optimization of an Omnidirectional Humanoid Walk:A Winning Approach at the RoboCup 2011 3D Simulation Competition

Design and Optimization of an Omnidirectional Humanoid Walk:A Winning Approach at the RoboCup 2011 3D Simulation Competition. Patrick MacAlpine, Samuel Barrett, Daniel Urieli, Victor Vu, and Peter ...

www.cs.utexas.edu8d

To Teach or not to Teach? Decision Making Under Uncertainty in Ad Hoc Teams

To Teach or not to Teach? Decision Making Under Uncertainty in Ad Hoc Teams. Peter Stone and Sarit Kraus. In The Ninth International Conference on Autonomous Agents and Multiagent Systems (AAMAS), ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results