This page contains selected Open Source software projects created by the Knowledge Technology Group.
Source code of Ball Localization CNN - Paper (RoboCup, 2016)
This is the source code (Python) for the paper "Ball Localization for Robocup Soccer using Convolutional Neural Networks", which proposes a new approach for localizing the ball in RoboCup humanoid soccer. A deep neural architecture is used without any preprocessing at all. The localization part gets solved by a convolutional neural network that is trained with probability distributions on full images. No sliding-window approach is used. The paper was written in the context of the Bachelor Thesis of Daniel Speck. The paper won the Best Paper Award for Engineering Contribution at the 20th Annual RoboCup International Symposium 2016 in Leipzig, Germany.
D. Speck, P. Barros, C. Weber, S. Wermter (2016) Ball Localization for Robocup Soccer using Convolutional Neural Networks. RoboCup 2016: Robot World Cup XX.
Contact:Pablo Barros, Cornelius Weber, Stefan Wermter
Domain- and Cloud-based Knowledge for Speech Recognition - DOCKS
Google, Apple, Bing, and similar services offer very good and easily retrievable cloud-based automated speech recognition (ASR) for many languages and are taking advantage of constant improvements on the server side. However, these ASR systems cannot be adapted with domain knowledge (e.g. by restricting the recognizer to a fixed vocabulary, grammar, or statistical language model) which may result in poor application performance.
To address this gap, the Knowledge Technology group developed DOCKS (Domain- and Cloud-based Knowledge for Speech recognition), which combines the well trained acoustic models of the cloud services with domain-specific knowledge. It has been shown that DOCKS can improve the results of a cloud-based system like Google speech significantly if the domain can be restricted.
Contact:Johannes Twiefel, MSc, Dipl.-Inform. Stefan Heinrich,
Professor Dr. Stefan Wermter, Dr. Timo Baumann Dipl.-Ing. Erik Strahl
Source code of Syntactic Reservoir Model (PLoS ONE, 2013)
This is the source code (Python) given in supplementary material of our
PLoS ONE paper of 2013. It enables to reproduce experiments of the paper
and gives a full access to the model: you will be able to modify all the
parameters and change the model. It uses Python libraries like Numpy,
and also uses the Oger toolbox developed within the EU FP7 Organic
X. Hinaut, P.F. Dominey (2013) Real-Time Parallel Processing of Grammatical Structure in the Fronto-Striatal System: A Recurrent Network Simulation Study Using Reservoir Computing. PloS ONE 8(2): e52946.