35 Min.

Silo AI: Why Körber, Philips or Unilever work with them‪?‬ Industrial AI Podcast

    • Technologie

Silo AI and TurkuNLP are developing a family of multilingual open source LLMs with the aim of strengthening Europe's digital sovereignty and democratising access to LLMs. Developing baseline models that are in line with European values is crucial to this effort to ensure that they are based on data and information that accurately represents the different languages, citizens, organisations and cultural landscape of the European Union. This approach is not only in line with European values, but also enables sovereignty over how downstream applications and value are created.

The success was attributed to the combination of the low-resource Finnish language with resource-rich languages. The team worked to determine the optimal frequency of data reuse for low-resource languages during training, and integrated translated text pairs from English and Finnish. This strategy, which relies on a cross-linguistic signal to improve the model's understanding of the relationships between the languages, proved crucial in achieving excellent performance in the low-resource languages without compromising performance in English.

Silo AI and TurkuNLP are developing a family of multilingual open source LLMs with the aim of strengthening Europe's digital sovereignty and democratising access to LLMs. Developing baseline models that are in line with European values is crucial to this effort to ensure that they are based on data and information that accurately represents the different languages, citizens, organisations and cultural landscape of the European Union. This approach is not only in line with European values, but also enables sovereignty over how downstream applications and value are created.

The success was attributed to the combination of the low-resource Finnish language with resource-rich languages. The team worked to determine the optimal frequency of data reuse for low-resource languages during training, and integrated translated text pairs from English and Finnish. This strategy, which relies on a cross-linguistic signal to improve the model's understanding of the relationships between the languages, proved crucial in achieving excellent performance in the low-resource languages without compromising performance in English.

35 Min.

Top‑Podcasts in Technologie

Lex Fridman Podcast
Lex Fridman
c’t uplink - der IT-Podcast aus Nerdistan
c’t Magazin
Bits und so
Undsoversum GmbH
Hard Fork
The New York Times
NewMinds.AI -  Podcast
Jens Polomski & Max Anzile
Flugforensik - Abstürze und ihre Geschichte
Flugforensik