Financial Market Applications of LLMs

The AI revolution drove frenzied investment in both private and public companies and captured the public’s imagination in 2023. Transformational consumer products like ChatGPT are powered by Large Language Models (LLMs) that excel at modeling sequences of tokens that represent wo … | Continue reading


@thegradient.pub | 8 days ago

A Brief Overview of Gender Bias in AI

A brief overview and discussion on gender bias in AI | Continue reading


@thegradient.pub | 20 days ago

Mamba Explained

Is Attention all you need? Mamba, a novel AI model based on State Space Models (SSMs), emerges as a formidable alternative to the widely used Transformer models, addressing their inefficiency in processing long sequences. | Continue reading


@thegradient.pub | 1 month ago

Car-GPT: Could LLMs finally make self-driving cars happen?

Exploring the utility of large language models in autonomous driving: Can they be trusted for self-driving cars, and what are the key challenges? | Continue reading


@thegradient.pub | 1 month ago

Do text embeddings perfectly encode text?

'Vec2text' can serve as a solution for accurately reverting embeddings back into text, thus highlighting the urgent need for revisiting security protocols around embedded data. | Continue reading


@thegradient.pub | 1 month ago

Do text embeddings perfectly encode text?

'Vec2text' can serve as a solution for accurately reverting embeddings back into text, thus highlighting the urgent need for revisiting security protocols around embedded data. | Continue reading


@thegradient.pub | 1 month ago

Why Doesn’t My Model Work?

Have you ever trained a model you thought was good, but then it failed miserably when applied to real world data? If so, you’re in good company. | Continue reading


@thegradient.pub | 2 months ago

Deep learning for single-cell sequencing: a microscope to see the diversity of cells

On the the pivotal role that Deep Learning has played as a key enabler for advancing single-cell sequencing technologies. | Continue reading


@thegradient.pub | 3 months ago

Salmon in the Loop

On fish counting – a complex sociotechnical problem in a field that is going through the process of digital transformation. | Continue reading


@thegradient.pub | 4 months ago

Neural algorithmic reasoning

In this article, we will talk about classical computation: the kind of computation typically found in an undergraduate Computer Science course on Algorithms and Data Structures [1]. Think shortest path-finding, sorting, clever ways to break problems down into simpler problems, in … | Continue reading


@thegradient.pub | 6 months ago

The Artificiality of Alignment

This essay first appeared in Reboot. Credulous, breathless coverage of “AI existential risk” (abbreviated “x-risk”) has reached the mainstream. Who could have foreseen that the smallcaps onomatopoeia “ꜰᴏᴏᴍ” — both evocative of and directly derived from children’s cartoons — | Continue reading


@thegradient.pub | 6 months ago

An Introduction to the Problems of AI Consciousness

Once considered a forbidden topic in the AI community, discussions around the concept of AI consciousness are now taking center stage, marking a significant shift since the current AI resurgence began over a decade ago. | Continue reading


@thegradient.pub | 7 months ago

Text-to-CAD: Risks and Opportunities

In the realm of AI-powered text-to-CAD, there's promise, but also a surge in subpar designs. Can we steer this technology towards better outcomes? | Continue reading


@thegradient.pub | 7 months ago

Interpretability Creationism

On “interpretability creationism” – interpretability methods that only look at the final state of the model and ignore its evolution over the course of training | Continue reading


@thegradient.pub | 9 months ago

What Do LLMs Know About Linguistics? It Depends on How You Ask

On the phenomenon of LLM sensitivity to prompting choices through two core linguistic tasks and categorize how specific prompting choices can affect the model's behavior. | Continue reading


@thegradient.pub | 9 months ago

Why transformative artificial intelligence is really, really hard to achieve

A collection of the best technical, social, and economic arguments Humans have a good track record of innovation. The mechanization of agriculture, steam engines, electricity, modern medicine, computers, and the internet—these technologies radically changed the world. Still, the … | Continue reading


@thegradient.pub | 10 months ago

Modern AI is Domestification

On taming wild internet-scale data distributions to make base foundation models useful and performant for specific tasks. | Continue reading


@thegradient.pub | 11 months ago

Artificial Curiosity as Moral Virtue

A painter looks at her work of art and asks herself, “I’m not sure how good it is. Should I ask my colleagues? Or should I ask my computer?” The latter question, as absurd as it may seem, is not so out of the norm to | Continue reading


@thegradient.pub | 11 months ago

In-Context Learning, In Context

On the phenomenon of in-context learning in large language models and what researchers have learned about it so far. | Continue reading


@thegradient.pub | 1 year ago

Software²: A new generation of AIs that become increasingly general by producing their own training data

On a likely shift in research trends toward a stronger focus on data collection and generation as a principal means to improve model performance. | Continue reading


@thegradient.pub | 1 year ago

Grounding Large Language Models in a Cognitive Foundation: How to Build Someone We Can Talk To

Many intelligent robots have come and gone, failing to become a commercial success. We’ve lost Aibo, Romo, Jibo, Baxter—even Alexa is reducing staff. Perhaps they failed to reach their potential because you can’t have a meaningful conversation with them. We are now at an | Continue reading


@thegradient.pub | 1 year ago

Towards Geometric Deep Learning

Geometric Deep Learning is a term for approaches considering ML problems from the perspectives of symmetry and invariance. It provides a common blueprint for CNNs, GNNs, and Transformers. Here, we study the history of GDL from ancient Greek geometry to Graph Neural Networks. | Continue reading


@thegradient.pub | 1 year ago

Artists enable AI art - shouldn't they be compensated?

The debate around artist compensation in AI art, and some possible solutions to the problem | Continue reading


@thegradient.pub | 1 year ago

Do Large Language Models learn world models or just surface statistics?

A mysteryLarge Language Models (LLM) are on fire, capturing public attention by their ability to provide seemingly impressive completions to user prompts (NYT coverage). They are a delicate combination of a radically simplistic algorithm with massive amounts of data and computing … | Continue reading


@thegradient.pub | 1 year ago

Reasons to Punish Autonomous Robots

1. IntroductionDeploying autonomous robots in military contexts strikes many people as terrifying and morally odious. What lies behind those reactions? One thought is that if a sophisticated artificial intelligence were causally responsible for some harm, there will be no one to … | Continue reading


@thegradient.pub | 1 year ago

Learning to Make the Right Mistakes - a Brief Comparison Between Human Perception and Multimodal LMs

Intelligence is not always about being correct, it sometimes is about making the right mistakes based on one’s understanding of the world. | Continue reading


@thegradient.pub | 1 year ago

Artificial Curiosity as Moral Virtue

A painter looks at her work of art and asks herself, “I’m not sure how good it is. Should I ask my colleagues? Or should I ask my computer?” The latter question, as absurd as it may seem, is not so out of the norm to some. Given the | Continue reading


@thegradient.pub | 1 year ago

Artificial Intelligence and the Future of Demos

Artificial Intelligence (AI) has an increasing say in the range of opportunities we are offered in life. Artificial neural networks might be used in deciding whether you will get a loan, an apartment, or your next job based on datasets collected from around the globe. Generative … | Continue reading


@thegradient.pub | 1 year ago

Causal Inference: Connecting Data and Reality

Causation is everywhere in life. However, compared to other concepts such as statistical correlation, causality is very difficult to define. In this article, we explore statistical approaches to defining causality. | Continue reading


@thegradient.pub | 1 year ago

The Future of Speech Recognition: Where Will We Be in 2030?

The last two years have been some of the most exciting and highly anticipated in Automatic Speech Recognition’s (ASR’s) long and rich history, as we saw multiple enterprise-level fully neural network-based ASR models go to market (e.g. Alexa, Rev, AssemblyAI, ASAPP, etc). The acc … | Continue reading


@thegradient.pub | 1 year ago

Symmetries, Scaffolds, and a New Era of Scientific Discovery

[T]he field of geometric deep learning, a field that exploits the symmetries of machine learning models in non-Euclidean domains, is increasingly being used to speed up nearly every stage of the drug development pipeline. | Continue reading


@thegradient.pub | 1 year ago

Overview of Graph Theory and Alzheimer's Disease

The Roman physician Galen was among the first people to realize that the brain controlled motor responses, cognitive function, and memory. (Freemon 1994) But how does the brain control these processes? Ever since Galen, this question has propelled the field of neuroscience. Begin … | Continue reading


@thegradient.pub | 1 year ago

Lessons from the GPT-4Chan Controversy

What happened with GPT-4chan, aka 'the worst AI ever', and what can be learned from it | Continue reading


@thegradient.pub | 1 year ago

AI Is Ushering in a New Scientific Revolution

AI is ushering in a new scientific revolution by making remarkable breakthroughs in a number of fields, unlocking new approaches to science, and accelerating the pace of science and innovation. | Continue reading


@thegradient.pub | 1 year ago

Working on the Weekends – An Academic Necessity?

If there is one thing that seems more typical of academia than having a terrible work-life balance, it is complaining about the terrible work-life balance we have. So what do we do about it? | Continue reading


@thegradient.pub | 1 year ago

Lessons From Deploying Deep Learning To Production

Best practices for efficiently training and deploying production-level deep learning models. | Continue reading


@thegradient.pub | 1 year ago

An Illustrated Tour of Applying BERT to Speech Data

This is an updated version of a piece originally posted on the author’s blog.Released in 2018, BERT has quickly become one of the most popular models for natural language processing, with the original paper accumulating tens of thousands of citations. Its success recipe lies in a … | Continue reading


@thegradient.pub | 1 year ago

Beyond Message Passing, a Physics-Inspired Paradigm for Graph Neural Networks

On going beyond message-passing based graph neural networks with physics-inspired “continuous” learning models | Continue reading


@thegradient.pub | 1 year ago

Focus on the Process: Formulating AI Ethics Principles More Responsibly

Artificial Intelligence (AI) systems have been involved in numerous scandals in recent years. For instance, take the COMPAS recidivism algorithm. The algorithm evaluated the likelihood that defendants will commit another crime in the future. It was widely used in the US criminal … | Continue reading


@thegradient.pub | 1 year ago

Deep Learning in Neuroimaging

An introduction to unique aspects of neuroimaging data and how we can leverage these aspects with deep learning algorithms. | Continue reading


@thegradient.pub | 1 year ago

New Technology, Old Problems: The Missing Voices in Natural Language Processing

Who and what is being represented in data and development of NLP models, and how unequal representation leads to unequal allocation of the benefits of NLP technology | Continue reading


@thegradient.pub | 2 years ago

Reading the Tea Leaves: Expert End-Users Explaining the Unexplainable

On the ways in which domain experts adapt to new technologies that lack explainability | Continue reading


@thegradient.pub | 2 years ago

Bootstrapping Labels via ___ Supervision & Human-In-The-Loop

Most machine learning tutorials and papers assume the availability of training labels. This includes benchmark datasets such as OpenImages or SuperGLUE, or customer interaction behavior such as clicks or purchases. But what if labeled datasets are not available? We would have to … | Continue reading


@thegradient.pub | 2 years ago

One Voice Detector to Rule Them All

VAD is among the most important and fundamental algorithms in any production or data preparation pipelines related to speech | Continue reading


@thegradient.pub | 2 years ago

How Aristotle is fixing deep learning’s flaws

Who can deny the chilly breeze blowing through some quarters of the AI world? While many continue to bask in the glorious summertime ushered in by the ascendency of deep learning, some are sensing autumnal winds which carry with them cautionary words we have all heard many times, … | Continue reading


@thegradient.pub | 2 years ago

How AI is Changing Chemical Discovery

This article covers some of the more prominent usages of AI in chemistry, the parent discipline of the protein folding problem. | Continue reading


@thegradient.pub | 2 years ago

The Promise of Hierarchical Reinforcement Learning (2019)

This idea of temporal abstraction, once incorporated into reinforcement learning (RL), converts it into *hierarchical* reinforcement learning (HRL). | Continue reading


@thegradient.pub | 2 years ago

Engaging with Disengagement

How will traffic laws change as we slowly enter the autonomous vehicle era, and in general, the AI-driven 21st century? | Continue reading


@thegradient.pub | 2 years ago