Blog | LEMUR

19 Mar

ORCAS Update

This week I spent a lot of time fixing stability bugs with the ORCAS system and gathering data with the reinforcement learning system. With a learning system, stability is extremely important as learning can takes days or even weeks which means that any small issue can completely bring down the whol...

12 Mar

Learning How to Walk (or Fly)

Prathyush Katukojwala

This week the ORCAS team finished the communication layer between Gazebo and the reinforcement learning algorithm. With this, we were able to start training robots. We created a parameterized four legged robot and starting training a dense neural network (DNN) controller for it. Video In this sim...

05 Mar

Upcoming RoCo Paper Deadlines

Prathyush Katukojwala

The RoCo team is planning to submit two papers with deadlines in early April. The plan is to submit a paper on an interface for the co-design of mechanical and electro-functional structures to UIST 2018 and a paper on a unified design environment for multiple fabrication output forms for SCF 2018...

26 Feb

Comparison of Co-Optimization Algorithms

Prathyush Katukojwala

A recent blog post by Christian outlines an algorithm that we formulated for the co-optimization of structure and control of robots. This algorithm seeks to more tightly couple the co-optimization as compared to the algorithm described in [1]. Let's call the algorithm we propose Algorithm 1. Algo...

07 Feb

Learning How to Learn

Prathyush Katukojwala

Pictured: A simulation of the learning algorithm in a CartPole-v0 environment provided by OpenAI gym. This last week I started playing around with Tensorflow for reinforcement learning. I was able to implement a policy gradient with parameter-based exploration(PGPE) algorithm for learning a policy...

06 Nov

Squashing the Serialization Bug

Prathyush Katukojwala

This last weekend, I was finally able to fix the bug that had been eluding me for the past two weeks. Click here to read my previous blog post about this bug. My approach to fixing the bug was to painstakingly keeping track of the reference states of the different variables after each deserializ...

02 Nov

Through the Debugging Rabbit Hole (Weekly Update)

Prathyush Katukojwala

I spent the majority of this past week debugging a single issue in RoCo. We noticed that when we constrained subcomponent parameters in RoCo and rebuilt the component, the drawing would not reflect the constraint. The representation of the faces in RoCo are parameterized with Variables that define t...

26 Oct

Re-establishing the Hierarchy (Weekly Update)

Prathyush Katukojwala

My main progress this week was getting hierarchical composition to finally work again. There were issues in the old robot compiler backend that seemed to have sorted themselves out when we rewrote it. The main issue I had to deal with to get hierarchical composition working again was fixing yaml ser...

11 Oct

Weekly Update 10/11/17 - Prathyush

Prathyush Katukojwala

This week I worked on fixing and documenting bugs in RoCo to prepare it for an upcoming handover to a new team. I will be moving on to a new project later this quarter so in addition to working on RoCo, I performed a literature search for the current state of the art of reinforcement learning and le...

22 Jun

A Review of the Past Year

Prathyush Katukojwala

At the end of last year I had a constraint solving solution that did not scale well with complexity. It quickly became obvious that it would not suffice. Therefore, I began this year by tackling that problem once more. After trying a few more methods, we decided that the best solution would be to si...

LEMUR

ORCAS Update

Learning How to Walk (or Fly)

Upcoming RoCo Paper Deadlines

Comparison of Co-Optimization Algorithms

Learning How to Learn

Squashing the Serialization Bug

Through the Debugging Rabbit Hole (Weekly Update)

Re-establishing the Hierarchy (Weekly Update)

Weekly Update 10/11/17 - Prathyush

A Review of the Past Year

Search

Popular Tags