Chinese company DeepSeek launched its generative AI ‘r1 model’ in January. Run and primarily funded by Liang Wenfeng, a billionaire and former trader, DeepSeek focuses on novel AI research. DeepSeek has published scientific papers detailing the technological advances used in its ‘Large Language Model’ (LLM) r1 and released the weights of the model open source.
The model uses less energy to run and process the large amounts of human-created data required to ‘train’ it. For what was at that point a relatively unknown company, having to work around the embargo on the export of the chips needed to produce LLMs, r1 performed at a level comparable to the massive LLMs made by tech giants such as Google, OpenAI and others; and at a much cheaper cost.
Read more