Hangzhou DeepSeek Artificial Intelligence Basic Technology Research Co., Ltd. doing business as DeepSeek, is a Chinese artificial intelligence company that develops Large Language Models.
Its training cost was reported to be significantly lower than other LLMs. The company claims that it trained its V3 model for US$6 million
—far less than the US$100 million
cost for OpenAI’s GPT-4 in 2023—and using approximately one-tenth the computing power consumed by Meta’s comparable model, Llama 3.1 DeepSeek’s success against larger and more established rivals has been described as “upending AI”.