Ah, la recherche! Du temps perdu.
(source)
This is a summary of my general research interests, along with some highlights of recent activity. (updated: January 2011)
My first love is estimation and filtering. At Caltech I was infected with people’s enthusiasm on differential geometry.
When you apply geometry to estimation you get information geometry (Wikipedia summary, serious introduction).
When you apply estimation to geometry you get intrinsic estimation (see for example the theory of shape spaces, and my favorite paper).
I’m interested in both combinations (and both will provide a lifetime learning experience).
My main interest lies in robotics, which, from my point of view, includes all interactions of an intelligent agent with the real world. I believe that in my lifetime robotics will lead to some form of strong AI, and I hope to retire as a robopsychologist. While I’m waiting for that to happen, I’m particularly interested in perception problems.
I am fascinated by cybernetics, in its original meaning of comparative study of artificial and neural information processing: nature’s solutions are often much more robust than the current state of the art.
Simple is better than complicated. In robotics applications, this means staying close to the measurement space.
Formal is is better than informal. Also, “formal” usually implies “simple”, because you usually can prove something only if it is simple enough.
Data before models, as long as formal results can still be proved.
Open is better than closed. Reproducible research is a worthy goal. I predict that, in 15 years, it will be unthinkable to publish in engineering research or computational sciences without publishing the full source code.
Keep reading for some highlights from my research, or read my publications list.
My thesis work will focus on the problem of bootstrapping: learning a model of a robot’s sensorimotor cascade, and its environment, from zero prior information other than basic semantic. At the beginning, we only assume to have uninterpreted streams of bits representing the output of some sensors, and that we have some commands available, and that there is some causal relation from actions to observations. The situation is very similar to the video on the right.
Other than a worthy problem per se (it subsumes most problems of learning, calibration, fault detection, etc.), it is also a proxy for studying some aspects of the higher level of neural processing. In fact, it is believed that the cortex starts as a tabula rasa that adapts to the inputs (the evidence is that parts of it can be repurposed in subjects that lost some sensory capacity).
At this point, it is not clear if an agent can learn everything from the environment, or if there is something that should be known a priori. My goal has been to try to derive some precise formulation of the problem, and to find strong results (in the spirit of control theory) that can be built upon.
Representative papers:
(other preprints are undergoing double-blind reviews and cannot be published on a website; do not hesitate to ask for them if you are interested)
Video: The video shows learning of a bilinear model of a sensorimotor cascade for a camera. The agent starts with no previous knowledge on the sensor geometry, and by correlating observations with commands, it can learn a generative model for the data. The same model can be used for learning the dynamics of different sensors (range-finder, camera, field sampler). See many other videos of related experiments.
I have been collaborating with the Dickinson lab, in particular with Andrew Straw (who now has his own lab at the Institute of Molecular Pathology in Vienna, Austria) on the quantitative characterization of visually-driven behavior in Drosophila Melanogaster.
The video shows the kind of simulated data I am working with. Using a special apparatus with 11 cameras, the animal is tracked with a precision of a few millimiters. An ad-hoc simulator allows to reconstruct the visual stimulus experienced by the animal at a reasonable level of accuracy.
We then analyze the data asking whether we can infer the neural processing happening in the animal brain.
Video: Simulated visual input from tracking data of free-flying Drosophila Melanogaster (watch on Vimeo). On the right, you can see the simulated visual stimulus. On the left, the position of the fly in the arena (1m radius). Note that the video is slowed down about 10x with respect to the real data: fruit flies are very fast! (real time is in the left corner) Watch the video full-screen to appreciate the details.
(still needs update)
See also:
The ultimate match: Cantor meets Kalman!
On the performance of Kalman filtering with intermittent observations: a geometric approach with fractals
How to reach a consensus if you and your friends are a bunch of spiking neurons.
Real-valued average consensus over noisy quantized channels
How precise can a localization method be?
The achievable accuracy for range-finder localization Journal Version
How precise can a scan-matching method be?
On achievable accuracy for pose tracking
How precise is ICP?
An accurate closed-form estimate of ICP’s covariance
Global localization without the worries and anxiety of filtering.
Lazy Localization using the Frozen-Time Smoother
PLICP converges quadratically in a finite number of steps.
An ICP variant using a point-to-line metric
A global complete algorithm for scan matching.
Scan matching in the Hough domain
HSM in 3D: much more difficult!
HSM3D: Feature-Less Global 6DOF Scan-Matching in the Hough/Radon Domain
GPM, which tries to make the most of the odometry model.