State Space Modelling Transformer

AI21 debuts Jamba 1.5, boosting hybrid SSM transformer model to enable agentic AI

Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy. Learn more Transformers are the cornerstone of the ...

Forbes

Oscillatory State-Space Models: Toward Physical Intelligence

For a while now, we’ve been talking about transformers, frontier neural network logic models, as a transformative technology, no pun intended. But now, these attention mechanisms have other competing ...

WinBuzzer

NVIDIA Debuts Nemotron 3 Hybrid Mamba-MoE Models for Agentic AI

NVIDIA has released Nemotron 3 Nano, a hybrid Mamba-MoE model designed to cut inference costs by 60% and accelerate agentic ...

Forbes

Ten Questions With The Researchers Behind H3 On State Space Models

Recently, we talked to Dan Fu and Tri Dao – authors of “Hungry Hungry Hippos” (aka “H3”) – on our Deep Papers podcast. H3 is a proposed language modeling architecture that performs comparably to ...

InfoQ

State Space Models Can Enable AI in Low-Power Edge Computing

A monthly overview of things you need to know as an architect or aspiring architect. Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with ...

OfficeChai

NVIDIA Releases Nemotron 3 Model That Is As Capable As GPT-oss, Qwen, But Much Faster

NVIDIA creates much of the hardware that allows for the creation of AI models, but it’s now creating some very capable models ...

Searchenginejournal.com

Google DeepMind RecurrentGemma Beats Transformer Models

Google DeepMind published a research paper that proposes language model called RecurrentGemma that can match or exceed the performance of transformer-based models while being more memory efficient, ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results