Holger Arndt
Your Expert for Big Data Analytics, Machine Learning, Artificial Intelligence, Neural Networks and Text Mining
As a freelance project manager and data scientist, I develop solutions for data-intensive problems and provide expert knowledge in machine learning, big data analytics, artificial intelligence, deep learning and data mining or text mining.
Project Request Info for Recruiter
My Services
You can hire me as a project manager, data scientist, software developer or consultant.
I regularly work with start up companies and thus know the peculiar challenges and difficulties in this environment. As a coach or consultant, I can help you to elaborate your "big data" idea into a sound business plan and turn it into reality.
Project Management
- Project- and team management (certified PMP®)
- Interim CTO
- Coaching and consulting
- Agile software development, Scrum, Kanban
Data Analysis
- Data mining for biomedical data (InnoCentive award in 2008)
- Data mining for customer relationship management systems
- Analysis of social networks
- Analysis of unstructured and semi-structured data (Text)
- Automatic email analysis
- Social media monitoring
- Portfolio analysis
- Churn analysis
- Sentiment analysis
- Fraud detection
- Risk analysis
Consulting
- Development of business intelligence strategies
- Data mining and reporting solutions
- Analysis of cross-selling opportunities
- Optimization of marketing strategies
- Cost estimation for data mining projects
Software Development
- Definition of software architectures
- Scaling for big data
- Information extraction and information retrieval systems
- Development of triple store data bases from unstructured data (Text2RDF)
- Implementation of software solutions e.g. for bio-medical problems
- Decision support systems
- Systems for non-standard BI solutions
Expertise
Machine Learning
Artificial neural networks, deep learning, decision trees, liniear/logistic regression, support vector machine (SVM), KNN, bagging, boosting, random forest
Cluster Analysis
K-Means, hierarchical clustering, self-organizing map (SOM), DBSCAN, t-SNE
Graphical Models
Bayesian networks, naïve bayes, conditional random fields (CRF), hidden markov models (HMM), statistics
Text Mining
Named entity recognition, relation extraction, topic models, semantic web, part-of-speech tagging, ontologies, Apache Lucene, Elasticsearch
Optimization Methods
Simulated annealing, genetic algoritms, ant colony optimization
Programming Languages
Java, C, C++, C#, .NET, WPF, Matlab, PHP, JavaScript, Perl, Bash, Basic, Pascal, LaTex
Operating Systems
Windows, Linux, Xen, VMWare, Amazon AWS, VirtualBox
Databases
MySQL, MariaDB, PostgreSQL, Microsoft SQL Server, Oracle, SQLite, HSQLDB, Derby, MS Access, ORM, Hibernate
Web Technologies
HTML5, CSS3, XML, jQuery, Apache, Tomcat, Jetty, Bootstrap, Rest, SOAP, Axis, AJAX, WordPress, Google Adwords, Google Analytics, Piwik
Communication Protocols
TCP/IP, EOBD, OBD-II, ISO15765-4 CAN, ISO14229 UDS, ISO14230-4 KWP2000, ISO9141-2, KWP1281, SAE-J1850, SAE-J2411, RS-232
Project Management
PMP®, Scrum, Kanban, Jira, Confluence, Mantis, ZenDesk, Word, Excel, Powerpoint, Project
Development Tools
Eclipse, Visual Studio, IntelliJ, PhpStorm, CVS, SVN, Git, Maven, Selenium
Languages
German (mother language), English (professional working proficiency), French (elementary proficiency), Italian (elementary proficiency), Spanish (elementary proficiency)
Projects
Jun 2010 – today |
autoaid GmbH, Berlin Position: CTO Objective: Development of a self-learning diagnostic system for motor vehicles Technologies: machine learning neural networks text mining Elasticsearch CAN micro controllers www.autoaid.de |
Sep 2009 – May 2010 |
autoaid GmbH, Berlin
Position: Consultant and software developer Aufgabe: Development of a self-learning diagnostic system for motor vehicles Technologies: machine learning neural networks text mining www.autoaid.de |
Aug 2008 – Nov 2009 |
Refined Labs GmbH, Munich Position: Software Engineer Data Analytics Objective: Development of an automatic bid management system Technologies: Java Linux MySQL SVN machine learning optimization methods www.refinedlabs.com |
Sep 2007 – Aug 2008 |
Siemens AG, Munich Position: Data Scientist and Software Developer Objective: Development of a web-based platform for text analysis Technologies: named entity recognition relation extraction topic models Bayesian networks graphical models www.siemens.de |
Sep 2006 – Aug 2007 |
Siemens AG, Munich Position: Data Scientist and Software Developer Objective: Improvement of a web-based platform for analysis and maintenance of biomedical data Technologies: web services Axis SOAP Lucene ontologies www.siemens.de |
Jun 2007 – Jul 2007 |
Hospimed International, Regensburg Position: Software Developer and Consultant Objective: Maintenance of servers and databases Technologies: Linux Apache virtualization SQL Java PHP Perl www.hospimed.de |
Sep 2005 – Aug 2006 |
Siemens AG, Munich Position: Data Scientist and Software Developer Objective: Development of a web-based platform for analysis and maintenance of biomedical data Technologies: Java Tomcat HTML XML MySQL PostgreSQL ORM Hibernate www.siemens.de |
May 2006 – Jul 2007 |
ENETX eBusiness oHG, Regensburg Position: Software Developer and Consultant Objective: Maintenance of web site and databases Technologies: HTML SQL PHP www.enetx.de |
Jul 2003 – Aug 2005 |
Siemens AG, Munich Position: Software Developer and Data Scientist Objective: Various projects in Computational Neuroscience, implementation of algorithms and visualization tools Technologies: Java Matlab C C++ neural networks machine learning clustering www.siemens.de |
Publications
Jul 2010 |
Semi-Automatic Assignment of Work Items
Full Paper
Jonas Helming, Holger Arndt, Zardosht Hodaie, Maximilian Koegel, Nitesh Narayan 5th International Conference on Evaluation of Novel Approaches to Software Engineering (ENASE) , Athens, Greece Download |
Jun 2010 |
Universal Java Matrix Package
Workshop Abstract and Poster
Holger Arndt Workshop on Machine Learning Open Source Software, 27th International Conference on Machine Learning (ICML) , Haifa, Israel Download |
May 2010 |
From Informal Project Management Artifacts to Formal System Models
Workshop Paper
Jonas Helming, Nitesh Narayan, Holger Arndt, Maximilian Koegel, Walid Maleej Flexitools Workshop, International Conference for Software Engineering (ICSE) , Capetown, South Africa Download |
Jul 2009 |
Towards a Next-Generation Matrix Library for Java
Full Paper
Holger Arndt, Markus Bundschus, Andreas Nägele 33rd Annual IEEE International Computer Software and Applications Conference (COMPSAC) , Seattle, USA Download |
Jul 2009 |
The Java Data Mining Package – A Data Processing Library for Java
Abstract and Poster
Holger Arndt 33rd Annual IEEE International Computer Software and Applications Conference (COMPSAC) , Seattle, USA Download |
Jun 2008 |
Large-Scale Genetic Network Learning
Full Paper
Andreas Nägele, Holger Arndt, Mathäus Dejori 18th European Conference on Artificial Intelligence (ECAI) |
Jul 2006 |
Developing Gestural Input
Workshop Paper
Matthias Kranz, Stefan Freund, Paul Holleis, Albrecht Schmidt, Holger Arndt 6th International Workshop on Smart Appliances and Wearable Computing (IWSAWC) Download |
May 2003 |
Parallelimplementierung von rekurrenten selbstorganisierenden Karten
Diplomarbeit
Holger Arndt Lehrstuhl für Kommunikationsnetze , Technische Universität München Download |
Software
As I'm often confronted with problems which cannot be solved with standard methods, I have developed my own toolboxes for linear algebra, data analysis and machine learning. These software packages for Java have been released under an Open Source license and are now being used by companies and research institutes from all over the world. Commercial use is explicitly permitted and free of charge.
Universal Java Matrix Package
The Universal Java Matrix Package (UJMP) is an open source library for dense and sparse matrix computations and linear algebra in Java. In addition to the basic operations like matrix multiplication, matrix inverse or matrix decomposition, it also supports visualization, JDBC import/export and many other useful functions such as mean, correlation, standard deviation, mutual information, or the replacement of missing values.
It's a swiss army knife for data processing in Java, tailored to machine learning applications.
Java Data Mining Package
The Java Data Mining Package (JDMP) is an open source Java library for data analysis and machine learning. It facilitates the access to data sources and machine learning algorithms (e.g. clustering, regression, classification, graphical models, optimization) and provides visualization modules. JDMP provides a number of algorithms and tools, but also interfaces to other machine learning and data mining packages (Weka, LibLinear, Elasticsearch, LibSVM, Mallet, Lucene, Octave).
How-To
This is a collection of practical tips for common tasks for programming and server configuration (mostly for Linux).
Apt Pinning
Configure Apache Web Server
Configure Default Editor for Debian
Configure sudo
Create SSL Certificate
Encode Video with mencoder
Encrypt Disk Using cryptsetup
Git Tricks
Important Browser Plugins and Extensions
Install Elasticsearch
Install GlusterFS
Install KVM
Install Samba File Server
Install Squid Proxy Server
Install SVN
Maven Tricks
MySQL Tricks
PDF Tricks
Raid How-To Using mdadm
Setup Ansible
Setup iptables Firewall
Links
Big Data and Machine Learning Links
Google books Ngram Viewer Huge database of 1-grams to 5-grams ordered by language
NLP-Tools List of software and datasets for natural language processing
Linked Data Connect knowledge and data on the Semantic Web using URIs and RDF
Development
TCPDF Library for creating PDFs in PHP
Barcode JQuery Library for creating barcodes
Misc Links
AnyDesk Alternative to TeamViewer for remote desktop sessions
xkcd One of the best comics on the web
Urban Dictionary English slang dictionary
MusicBrainz Open music encyclopedia for music metadata
Publish or Perish Software to analyze acedemic publications and calculate impact factor
Paletton Find matching color palettes
Escape The City Job opportunities that are more exciting and different
Contact
I'm looking forward to your project request being open for interesting challenges, whether in Munich, Berlin, Germany or abroad: mail @ holger - arndt . com