This opportunity is based in Zurich

Information extraction from structured and unstructured documents


Companies handle high volume of unstructured documents such as letters, forms or ID scans on a daily basis. A lot of manual work is invested to manually extract from these documents the information necessary to perform a business process (dates, contract number, addresses etc). This internship will focus on automatically extracting information from those unstructured and structured documents. To do so, the student will design and build machine learning and deep learning pipelines, using state of the art libraries. The final goal is to integrate those pipelines into one of our existing NLP services.

In this role

The goal of this internship is to:

  • Build machine learning and deep learning pipelines to extract information from documents written in English, with a possibility to extend it to other languages (e.g. French and German). The intern will be responsible for pre-processing the documents, selecting features, building and evaluating models.
  • Integrate the best models to one of our existing products. The student will have the opportunity to sharpen his software development skills.

What we offer

Join our team as intern and you will find a young, dynamic and culturally diverse working environment.

About your profile

  • Interest and strong knowledge in Machine Learning and Natural Language Processing
  • Programming language: Python 3+
  • Deployment: Knowing Docker is a plus

If you are INTERESTED in applying for this position, please send us your complete application (CV, cover letter, letter of reference, diplomas and certificates).

By continuing to browse this site, you accept the use of cookies or similar technologies whose purpose is to produce statistics on visits to our site (tests and measurement of visitor numbers, visit frequency, page views and performance) and to offer you content and promotions which will be of interest to you.

Our cookie policy has been updated. Feel free to manage your preferences.


Manage your cookie preferences

Update your cookie preferences

Find out about the type of cookies stored on your device, accept or block them for the entire site, all services or on a service-by-service basis.

OK, accept all

Visitor flow

These cookies provide us with insight into traffic sources and allow us to better understand our visitors anonymously.

(Google Analytics and CrazyEgg)


Sharing tool

Social media cookies allow content sharing on your preferred networks.



Visitor understanding

These cookies are used to track visitors across websites.

The intention is to enable us to offer more relevant, targeted content to existing contacts (ClickDimensions) and display ads that are relevant and engaging for users (Facebook Pixels).


For more information about these cookies and our cookie policy, click here