This opportunity is based in Lausanne

Active Learning Annotation Tool (Internship)

Apply

Description  

While the frontiers in natural language processing (NLP) research are rapidly expanding, it is still a great challenge to develop customized NLP solutions in the real world. Properly annotated data is scarce, making it impossible to train or even finetune data-hungry algorithms. In particular for named entity recognition (NER), accurate annotations are crucial but very expensive to gather. Even more so in the 3 Swiss national languages.

Active learning methods can mitigate the problem by facilitating the annotation task and reducing the work effort required by domain experts.

Within the scope of this internship, the student will leverage an active learning approach to develop an efficient annotation tool and integrate it into one of our existing NLP products. In addition, the application of NER to data anonymization will be studied and developed. This application is becoming rapidly pervasive with all data analytics services that require a certain level of privacy and security. Existing libraries have very good performance when it comes to NER for English documents, however for other languages, the performance drops, often drastically. Data anonymization systems need to rely on highly performant extraction models, with minimal leakage and having such a system

In this role

Objectives  

The goal of the internship is to:

  • Study and compare state-of-the-art active learning and named entity recognition (NER) methods. The student will have the opportunity to deepen their knowledge in these two fields.
  • Design and implement an active learning annotation tool to annotate text corpora for NER tasks. The intern will be able to sharpen their software development and ML engineering skills.
  • Integrate the tool into one of our existing products. The student will be responsible for the productification of their solution.

What we offer

INTERNSHIP in Lausanne. Join our team as intern and you will find a young, dynamic and culturally diverse working environment.

About your profile

  • Interest and strong knowledge in Machine Learning and Natural Language Processing
  • Experience with Python (NumPy stack, common ML and NLP libraries)
  • Experience with Git, Docker and REST/microservices is a plus

If you are INTERESTED in applying for this position, please send us your complete application (CV, cover letter, letter of reference, diplomas and certificates).

By continuing to browse this site, you accept the use of cookies or similar technologies whose purpose is to produce statistics on visits to our site (tests and measurement of visitor numbers, visit frequency, page views and performance) and to offer you content and promotions which will be of interest to you.

Our cookie policy has been updated. Feel free to manage your preferences.

close
save

Manage your cookie preferences

Update your cookie preferences

Find out about the type of cookies stored on your device, accept or block them for the entire site, all services or on a service-by-service basis.

OK, accept all

Visitor flow

These cookies provide us with insight into traffic sources and allow us to better understand our visitors anonymously.

(Google Analytics and CrazyEgg)

New

Sharing tool

Social media cookies allow content sharing on your preferred networks.

(ShareThis)

New

Visitor understanding

These cookies are used to track visitors across websites.

The intention is to enable us to offer more relevant, targeted content to existing contacts (ClickDimensions) and display ads that are relevant and engaging for users (Facebook Pixels).

 

New
For more information about these cookies and our cookie policy, click here