January 24, 2025 report

Self-adaptive LLM dynamically adjusts its weights to learn new tasks

by Bob Yirka , Tech Xplore

Self-adaptive AI LLM—Transformer²—dynamically adjusts its weights to learn new tasks — Method overview. Left) At training time, we employ SVF and RL to learn the "expert" vectors z's that scale the singular values of the weight matrices. Right) At inference time, we propose three distinct methods to adaptively select/combine the learned expert vectors. Credit: *arXiv* (2025). DOI: 10.48550/arxiv.2501.06252

A trio of AI researchers at Sakana AI, a Japanese startup, has announced the development of a self-adaptive AI LLM called Transformer². Qi Sun, Edoardo Cetin, and Yujin Tang, have posted their paper on the arXiv preprint server.

As LLMs mature, AI researchers continue to refine them to be more efficient and less energy demanding. In this new study, the research trio has found a way to reduce one of the major inefficiencies in traditional LLMs—the need for fine-tuning if they are asked to do something they have not been trained to do.

Under current scenarios, an LLM's parameters are adjusted and it is then trained with new samples—afterward, the new parameters remain frozen in place. The research team has introduced a model that makes adjustments to a system of weights when it is introduced to something new, to allow it to adjust dynamically to new types of tasks.

To allow the LLM to carry out dynamic adjustments, the researchers have split the task response into a two-step approach; the first involves analyzing the request and figuring out what will be required to provide a good response. The second involves making adjustments to a system of weights to help it focus its efforts on things that will lead to an answer.

The system of weights uses a math process called Singular Value Decomposition to determine which parts of its own AI system are the most important for providing the best possible answer. Reinforcement learning is applied to create the steps needed to guide the AI's behavior.

During inference, (which is the part of the system involved in generating responses to the initial query), the system employs three main strategies to achieve its goals—one that is based on the prompt, another that serves as a classifier and the third that applies a few-shot adaptation process (where an AI model learns from a limited training set). Once the weights have been applied, the LLM carries on in similar fashion to other LLMs.

The overall result of using the new approach is that it allows an LLM to adjust itself on the fly when it finds itself faced with an unfamiliar task. Testing of the system showed it capable of performing as well as other LLMs on traditional queries but much more flexible when it came to answering queries that confused other models.

More information: Qi Sun et al, Transformer²: Self-adaptive LLMs, arXiv (2025). DOI: 10.48550/arxiv.2501.06252

Journal information: arXiv

Citation: Self-adaptive LLM dynamically adjusts its weights to learn new tasks (2025, January 24) retrieved 26 January 2025 from https://techxplore.com/news/2025-01-llm-dynamically-adjusts-weights-tasks.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Self-adaptive LLM dynamically adjusts its weights to learn new tasks

Eco-friendly aluminum battery lasts 10,000 cycles with minimal loss

Neural networks model improves machine vision and object detection under low-light conditions

New research shows many UK homes can adopt heat pumps with minimal upgrades

Bioinspired 3D printing: Architected design creates efficient structures

Butterfly-inspired method for robot wing movement works without electronics or batteries

Scaling up neuromorphic computing for more efficient and effective AI everywhere and anytime

How good old mud can lower building costs

Electric vehicles now match traditional cars for longevity, study finds

OpenAI unveils 'Operator' agent that handles web tasks

Chatbot offers empathetic, multilingual crime reporting to ease dispatcher workload

LlamaV-o1: Curriculum learning–based LLM shows benefits of step-by-step reasoning in AI systems

Software engineers develop a way to run AI language models without matrix multiplication

New method allows AI to learn indefinitely

AI models tested for privacy-safe radiology report analysis

Teaching LLMs how to know when to ask for help to provide more accurate answers

Test of 'poisoned dataset' shows vulnerability of LLMs to medical misinformation

Neural networks model improves machine vision and object detection under low-light conditions

Scaling up neuromorphic computing for more efficient and effective AI everywhere and anytime

OpenAI unveils 'Operator' agent that handles web tasks

Chatbot offers empathetic, multilingual crime reporting to ease dispatcher workload

Embodied AI reveals how robots and toddlers learn to understand

People overestimate reliability of AI-assisted language tools: Adding uncertainty phrasing can help

Phys.org

Medical Xpress

Science X

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Self-adaptive LLM dynamically adjusts its weights to learn new tasks

Eco-friendly aluminum battery lasts 10,000 cycles with minimal loss

Neural networks model improves machine vision and object detection under low-light conditions

New research shows many UK homes can adopt heat pumps with minimal upgrades

Bioinspired 3D printing: Architected design creates efficient structures

Butterfly-inspired method for robot wing movement works without electronics or batteries

Scaling up neuromorphic computing for more efficient and effective AI everywhere and anytime

How good old mud can lower building costs

Electric vehicles now match traditional cars for longevity, study finds

OpenAI unveils 'Operator' agent that handles web tasks

Chatbot offers empathetic, multilingual crime reporting to ease dispatcher workload

Related Stories

LlamaV-o1: Curriculum learning–based LLM shows benefits of step-by-step reasoning in AI systems

Software engineers develop a way to run AI language models without matrix multiplication

New method allows AI to learn indefinitely

AI models tested for privacy-safe radiology report analysis

Teaching LLMs how to know when to ask for help to provide more accurate answers

Test of 'poisoned dataset' shows vulnerability of LLMs to medical misinformation

Recommended for you

Neural networks model improves machine vision and object detection under low-light conditions

Scaling up neuromorphic computing for more efficient and effective AI everywhere and anytime

OpenAI unveils 'Operator' agent that handles web tasks

Chatbot offers empathetic, multilingual crime reporting to ease dispatcher workload

Embodied AI reveals how robots and toddlers learn to understand

People overestimate reliability of AI-assisted language tools: Adding uncertainty phrasing can help

Your Privacy

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.