MIT Large Language Model

MIT researchers use large language models to flag problems in complex systems

Researchers used large language models to efficiently detect anomalies in time-series data, without the need for costly and cumbersome training steps. This method could someday help alert technicians ...

SiliconANGLE

DeepSeek releases improved V3 model under MIT license

DeepSeek today released an improved version of its DeepSeek-V3 large language model under a new open-source license. Software developer and blogger Simon Willison was first to report the update.

VentureBeat

Beyond static AI: MIT's new framework lets models teach themselves

Researchers at MIT have developed a framework called Self-Adapting Language Models (SEAL) that enables large language models (LLMs) to continuously learn and adapt by updating their own internal ...

techtimes

Large Language Model Limitations: Why Generative AI Still Has a Long Way to Go, Researchers Say

As great as generative AI looks, researchers at Harvard, MIT, the University of Chicago, and Cornell concluded that LLMs are not as reliable as we believe. Even a big company like Nintendo did not ...

MIT Technology Review

Does AI know too much?

The growing field of machine unlearning aims to make large language models forget harmful information without retraining them ...

Virtualization Review

Large Language Model Selection -- Why the Parameter Count Isn't Everything

When choosing a large language model (LLM) for use in a particular task, one of the first things that people often look at is the model's parameter count. A vendor might offer several different ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results