Self-hosted, Open Source Large Language Models (LLMs)

What are the most promising projects and how good are they?

11:2010/11/2023

In recent years different groups have used the transformer architecture (a deep learning model) to train neural networks using large quantities of text. With the increase in compute power these models have grown to billions or even hundred of billions of parameters. As the model size grew, noteworthy abilities emerged. Such as the ability to generate text showing surprising reasoning skills to the point that the leading models can now successfully take college-level exams.

Currently some of the best and most famous models are proprietary and released to the public as a service. However a large Open Source community has emerged that tries to train and fine tune free models that can be used self-hosted. This is a challenging task due to problems with potential copyright issues with the training text, the large computational cost of the training itself and the supervised fine tuning step to adapt the model to its final use case.

In this talk I will give an overview on what the most promising projects in this space are and how they compare to the proprietary state-of-the-art models of the large players.

Video

Presentations

Download slides (pdf)

Speaker

Chris Mair
1006.org

Our Supporters

Our Partners

Un'esperienza su misura

Questo sito utilizza cookie tecnici e, previa acquisizione del consenso, cookie analitici e di profilazione, di prima e di terza parte. La chiusura del banner comporta il permanere delle impostazioni e la continuazione della navigazione in assenza di cookie diversi da quelli tecnici. Il tuo consenso all’uso dei cookie diversi da quelli tecnici è opzionale e revocabile in ogni momento tramite la . Per avere più informazioni su ciascun tipo di cookie che usiamo, puoi leggere la nostra Cookie Policy.

Cookie utilizzati

Segue l’elenco dei cookie utilizzati dal nostro sito web.

Cookie tecnici necessari

I cookie tecnici necessari non possono essere disattivati in quanto senza questi il sito web non sarebbe in grado di funzionare correttamente. Li usiamo per fornirti i nostri servizi e contribuiscono ad abilitare funzionalità di base quali, ad esempio, la navigazione sulle pagine, la lingua preferita o l’accesso alle aree protette del sito.

Self-hosted, Open Source Large Language Models (LLMs)

What are the most promising projects and how good are they?

Video

Presentations

Speaker

Chris Mair

Our Supporters

Our Partners

Un'esperienza su misura

Cookie utilizzati

Cookie tecnici necessari

Prima parte3

cm_cookie_sfscon

w3tc_logged_out

__cf_bm