Language tagging for NLP
On several occasions we worked with a leading natural language processing (NLP) company that needed to label a huge corpus of texts in Italian and English.

Qabiria’s approach to understanding our needs ensured a flawless experience from start to finish. With Qabiria, it is not just about achieving the desired results, but exceeding expectations. We highly recommend their services to anyone seeking uncompromising professionalism and results-oriented solutions.

500
thousand words
8
linguists involved
3
weeks
About Leader in natural language processing
Provider of tools and services for annotating corpora and datasets, both at the word and phrase level.
Case study summary
Industry | IT and software |
---|---|
Services | Consulting |
Languages | Italian, English |
The challenge
Current language services go beyond just translation. In Qabiria we are used to tackling projects that straddle different disciplines.
In 2020, we were contacted by a leading company in natural language processing (NLP) that needed to label a corpus of texts, taken from conversations on a social network, that had various languages mixed together. Labeling, or tagging, involved identifying the languages, parts of speech, and functions performed by each element.
The project was particularly complex for several reasons: first, the client’s very tight timelines. Under pressure from the final client, a large international group, the project of about 500,000 words had to be completed in just 3 weeks.
Secondly, we were required to use brat rapid annotation tool, a specific structured annotation tool, which we had never used until then. In this case, it was sufficient to connect via a browser to the brat instance on the client’s servers to access the assigned tasks and perform them in collaborative mode.
Finally, the presence of other languages, in addition to Italian, entailed an additional complication, as one had to distinguish the use of simple Anglicisms, which are very frequent in Italian, from the parts properly in other languages.
The solution
To cope with the client’s urgency, we prepared in a few days a team of 8 linguists who worked in a coordinated way. After training them on the activity to be done and the system to be used, we assigned each person his or her part and set up a system to keep track of the hours spent.
Qabiria has also assigned a dedicated project manager to this position. This person maintained contact with the client, monitored adherence to timelines, responded to linguists' concerns, and oversaw the entire project.
The result
After completing a first part of the project on time and in the required manner, the client expressed full satisfaction and sent other similar tagging work in the following months.
To tackle these new jobs, we have further optimized processes and significantly reduced delivery times, while still maintaining the same level of quality.
Further Reading
Services provided to Leader in natural language processing
English translations
We translated the marketing texts of Leader in natural language processing into English.
Learn more about our English translation servicesSpanish translations
We translated the marketing texts of Leader in natural language processing into Spanish.
Learn more about our Spanish translation servicesWeb Localisation
We helped Leader in natural language processing internationalize their e-commerce.
Learn more about our web localisation servicesGet the same results as Leader in natural language processing!
Let us know what you need by sending an email to hola@qabiria.com or by filling in the contact form. We guarantee a response within 24 hours, but usually we’re much faster.
Contact us