I am a Senior Researcher at Idiap Research Institute, where I lead the Audio Inference Group. We work generally in artificial intelligence and particularly in speech processing on the conjunction of recognition and synthesis. In each case, we try to focus on physiologically plausible and statistically rigorous systems. Here is a short bio.
A (possibly incomplete) list of my patents. They are mainly US, but in practice most of these began life as either GB or Japan patents; the US ones are just easier to reference.
Other technical reports that are not commercial in confidence or published elsewhere. The list includes some MPEG submissions that made it into MPEG-7.
Most work is on my GitHub page.
- For a while now I’ve been trying to find a sustainable way of maintaining software for the group. BuSLR was a failed attempt, but does contain some useful conda recipes. Right now, the most promising approach is homebrew, for which I maintain a tap.
- SSP is a signal processing library in python. It contains, amongst other things, a feature extractor and continuous pitch estimator. libssp duplicates some functionality in a C++ library.
- ISS is/was a framework for building and evaluating ASR and TTS models. Based around HTS it’s quite old.
- I wrote and (infrequently) maintain Tracter, a data-flow framework that formed the feature acquisition mechanism of Juicer. Tracter wraps several other packages, including libssp.
Look at the username above and add
@idiap.ch. It’ll probably get
here. Otherwise try the
Idiap contact page.