Holger Arndt

Your Expert for Big Data Analytics, Machine Learning, Artificial Intelligence, Neural Networks and Text Mining

As a freelance project manager and data scientist, I develop solutions for data-intensive problems and provide expert knowledge in machine learning, big data analytics, artificial intelligence, deep learning and data mining or text mining.

Project Request Info for Recruiter


My Services

You can hire me as a project manager, data scientist, software developer or consultant.

I regularly work with start up companies and thus know the peculiar challenges and difficulties in this environment. As a coach or consultant, I can help you to elaborate your "big data" idea into a sound business plan and turn it into reality.

Project Management

  • Project- and team management (certified PMP®)
  • Interim CTO
  • Coaching and consulting
  • Agile software development, Scrum, Kanban

Data Analysis

  • Data mining for biomedical data (InnoCentive award in 2008)
  • Data mining for customer relationship management systems
  • Analysis of social networks
  • Analysis of unstructured and semi-structured data (Text)
  • Automatic email analysis
  • Social media monitoring
  • Portfolio analysis
  • Churn analysis
  • Sentiment analysis
  • Fraud detection
  • Risk analysis

Consulting

  • Development of business intelligence strategies
  • Data mining and reporting solutions
  • Analysis of cross-selling opportunities
  • Optimization of marketing strategies
  • Cost estimation for data mining projects

Software Development

  • Definition of software architectures
  • Scaling for big data
  • Information extraction and information retrieval systems
  • Development of triple store data bases from unstructured data (Text2RDF)
  • Implementation of software solutions e.g. for bio-medical problems
  • Decision support systems
  • Systems for non-standard BI solutions


Expertise

Machine Learning

Artificial neural networks, deep learning, decision trees, liniear/logistic regression, support vector machine (SVM), KNN, bagging, boosting, random forest

Cluster Analysis

K-Means, hierarchical clustering, self-organizing map (SOM), DBSCAN, t-SNE

Graphical Models

Bayesian networks, naïve bayes, conditional random fields (CRF), hidden markov models (HMM), statistics

Text Mining

Named entity recognition, relation extraction, topic models, semantic web, part-of-speech tagging, ontologies, Apache Lucene, Elasticsearch

Optimization Methods

Simulated annealing, genetic algoritms, ant colony optimization

Programming Languages

Java, C, C++, C#, .NET, WPF, Matlab, PHP, JavaScript, Perl, Bash, Basic, Pascal, LaTex

Operating Systems

Windows, Linux, Xen, VMWare, Amazon AWS, VirtualBox

Databases

MySQL, MariaDB, PostgreSQL, Microsoft SQL Server, Oracle, SQLite, HSQLDB, Derby, MS Access, ORM, Hibernate

Web Technologies

HTML5, CSS3, XML, jQuery, Apache, Tomcat, Jetty, Bootstrap, Rest, SOAP, Axis, AJAX, WordPress, Google Adwords, Google Analytics, Piwik

Communication Protocols

TCP/IP, EOBD, OBD-II, ISO15765-4 CAN, ISO14229 UDS, ISO14230-4 KWP2000, ISO9141-2, KWP1281, SAE-J1850, SAE-J2411, RS-232

Project Management

PMP®, Scrum, Kanban, Jira, Confluence, Mantis, ZenDesk, Word, Excel, Powerpoint, Project

Development Tools

Eclipse, Visual Studio, IntelliJ, PhpStorm, CVS, SVN, Git, Maven, Selenium

Languages

German (mother language), English (professional working proficiency), French (elementary proficiency), Italian (elementary proficiency), Spanish (elementary proficiency)


Projects

Jun 2010 – today

autoaid Logo
autoaid GmbH, Berlin
Position: CTO
Objective: Development of a self-learning diagnostic system for motor vehicles
Technologies: machine learning neural networks text mining Elasticsearch CAN micro controllers
www.autoaid.de
Sep 2009 – May 2010

autoaid Logo
autoaid GmbH, Berlin
Position: Consultant and software developer
Aufgabe: Development of a self-learning diagnostic system for motor vehicles
Technologies: machine learning neural networks text mining
www.autoaid.de
Aug 2008 – Nov 2009

Refined Labs Logo
Refined Labs GmbH, Munich
Position: Software Engineer Data Analytics
Objective: Development of an automatic bid management system
Technologies: Java Linux MySQL SVN machine learning optimization methods
www.refinedlabs.com
Sep 2007 – Aug 2008

Siemens Logo
Siemens AG, Munich
Position: Data Scientist and Software Developer
Objective: Development of a web-based platform for text analysis
Technologies: named entity recognition relation extraction topic models Bayesian networks graphical models
www.siemens.de
Sep 2006 – Aug 2007

Siemens Logo
Siemens AG, Munich
Position: Data Scientist and Software Developer
Objective: Improvement of a web-based platform for analysis and maintenance of biomedical data
Technologies: web services Axis SOAP Lucene ontologies
www.siemens.de
Jun 2007 – Jul 2007

Hospimed Logo
Hospimed International, Regensburg
Position: Software Developer and Consultant
Objective: Maintenance of servers and databases
Technologies: Linux Apache virtualization SQL Java PHP Perl
www.hospimed.de
Sep 2005 – Aug 2006

Siemens Logo
Siemens AG, Munich
Position: Data Scientist and Software Developer
Objective: Development of a web-based platform for analysis and maintenance of biomedical data
Technologies: Java Tomcat HTML XML MySQL PostgreSQL ORM Hibernate
www.siemens.de
May 2006 – Jul 2007 ENETX eBusiness oHG, Regensburg
Position: Software Developer and Consultant
Objective: Maintenance of web site and databases
Technologies: HTML SQL PHP
www.enetx.de
Jul 2003 – Aug 2005

Siemens Logo
Siemens AG, Munich
Position: Software Developer and Data Scientist
Objective: Various projects in Computational Neuroscience, implementation of algorithms and visualization tools
Technologies: Java Matlab C C++ neural networks machine learning clustering
www.siemens.de


Publications

Jul 2010 Semi-Automatic Assignment of Work Items Full Paper
Jonas Helming, Holger Arndt, Zardosht Hodaie, Maximilian Koegel, Nitesh Narayan
5th International Conference on Evaluation of Novel Approaches to Software Engineering (ENASE) , Athens, Greece
Download
Jun 2010 Universal Java Matrix Package Workshop Abstract and Poster
Holger Arndt
Workshop on Machine Learning Open Source Software, 27th International Conference on Machine Learning (ICML) , Haifa, Israel
Download
May 2010 From Informal Project Management Artifacts to Formal System Models Workshop Paper
Jonas Helming, Nitesh Narayan, Holger Arndt, Maximilian Koegel, Walid Maleej
Flexitools Workshop, International Conference for Software Engineering (ICSE) , Capetown, South Africa
Download
Jul 2009 Towards a Next-Generation Matrix Library for Java Full Paper
Holger Arndt, Markus Bundschus, Andreas Nägele
33rd Annual IEEE International Computer Software and Applications Conference (COMPSAC) , Seattle, USA
Download
Jul 2009 The Java Data Mining Package – A Data Processing Library for Java Abstract and Poster
Holger Arndt
33rd Annual IEEE International Computer Software and Applications Conference (COMPSAC) , Seattle, USA
Download
Jun 2008 Large-Scale Genetic Network Learning Full Paper
Andreas Nägele, Holger Arndt, Mathäus Dejori
18th European Conference on Artificial Intelligence (ECAI)
Jul 2006 Developing Gestural Input Workshop Paper
Matthias Kranz, Stefan Freund, Paul Holleis, Albrecht Schmidt, Holger Arndt
6th International Workshop on Smart Appliances and Wearable Computing (IWSAWC)
Download
May 2003 Parallelimplementierung von rekurrenten selbstorganisierenden Karten Diplomarbeit
Holger Arndt
Lehrstuhl für Kommunikationsnetze , Technische Universität München
Download


Software

As I'm often confronted with problems which cannot be solved with standard methods, I have developed my own toolboxes for linear algebra, data analysis and machine learning. These software packages for Java have been released under an Open Source license and are now being used by companies and research institutes from all over the world. Commercial use is explicitly permitted and free of charge.

Universal Java Matrix Package

The Universal Java Matrix Package (UJMP) is an open source library for dense and sparse matrix computations and linear algebra in Java. In addition to the basic operations like matrix multiplication, matrix inverse or matrix decomposition, it also supports visualization, JDBC import/export and many other useful functions such as mean, correlation, standard deviation, mutual information, or the replacement of missing values.

It's a swiss army knife for data processing in Java, tailored to machine learning applications.

Tell me more

Java Data Mining Package

The Java Data Mining Package (JDMP) is an open source Java library for data analysis and machine learning. It facilitates the access to data sources and machine learning algorithms (e.g. clustering, regression, classification, graphical models, optimization) and provides visualization modules. JDMP provides a number of algorithms and tools, but also interfaces to other machine learning and data mining packages (Weka, LibLinear, Elasticsearch, LibSVM, Mallet, Lucene, Octave).

Tell me more


Contact

I'm looking forward to your project request being open for interesting challenges, whether in Munich, Berlin, Germany or abroad: mail @ holger - arndt . com