Memory Models - Search News

9don MSN

A Google AI breakthrough is pressuring memory chip stocks from Samsung to Micron

SK Hynix, Samsung and Micron shares fell as investors fear fewer memory chips may be required in the future.

18d

Nvidia says it can shrink LLM memory 20x without changing model weights

Nvidia's KV Cache Transform Coding (KVTC) compresses LLM key-value cache by 20x without model changes, cutting GPU memory costs and time-to-first-token by up to 8x for multi-turn AI applications.

22hOpinion

Did Alphabet Just Say Checkmate to Micron?

However, Alphabet's latest breakthrough may have changed that notion, which has triggered a sell-off in Micron 's ( MU 0.49%) ...

4don MSN

Caltech researchers claim radical compression of high-fidelity AI models

A team of researchers led by California Institute of Technology computer scientist and mathematician Babak Hassibi says it ...

10d

Google’s TurboQuant AI-compression algorithm can reduce LLM memory usage by 6x

Google Research recently revealed TurboQuant, a compression algorithm that reduces the memory footprint of large language ...

Nature

Recognition Memory Models and Decision Processes

Recognition memory research encompasses a diverse range of models and decision processes that characterise how individuals differentiate between previously encountered stimuli and novel items. At the ...

XDA Developers on MSN

TurboQuant tackles the hidden memory problem that's been limiting your local LLMs

A paper from Google could make local LLMs even easier to run.

The Five Trends Driving Memory To The Forefront Of AI Scaling

Memory is no longer just supporting infrastructure; it's now become a primary determinant of system performance, cost and ...

Geeky Gadgets

LangChain Memory Models : The Future of Conversational AI?

What if your AI could remember every meaningful detail of a conversation—just like a trusted friend or a skilled professional? In 2025, this isn’t a futuristic dream; it’s the reality of ...

Nature

Memory Models and Consistency in Concurrent Programming

Memory models offer the formal frameworks that define how operations on memory are executed in environments with concurrent processes. By establishing rules for the ordering and visibility of memory ...

VentureBeat

New LLM optimization technique slashes memory costs up to 75%

Researchers at the Tokyo-based startup Sakana AI have developed a new technique that enables language models to use memory more efficiently, helping enterprises cut the costs of building applications ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results