• Latest
Dnotitia Unveils STAR-KV, Achieving UP to 20x KV Cache Compression, Selected as an ICML 2026 Spotlight Paper

Dnotitia Unveils STAR-KV, Achieving UP to 20x KV Cache Compression, Selected as an ICML 2026 Spotlight Paper

July 2, 2026
Massive forest fires sweep through southern France

Massive forest fires sweep through southern France

July 2, 2026
85 hogs die in South Cotabato village from still unknown cause

85 hogs die in South Cotabato village from still unknown cause

July 2, 2026
Kyiv attacked after Ukraine’s Zelenskyy warns of ‘massive Russian strike’ | Russia-Ukraine war News

Kyiv attacked after Ukraine’s Zelenskyy warns of ‘massive Russian strike’ | Russia-Ukraine war News

July 2, 2026
‘I found it a really touching story’: WP’s Gerald Giam to raise ‘Dear You’ dialect issue in Parliament after watching film

‘I found it a really touching story’: WP’s Gerald Giam to raise ‘Dear You’ dialect issue in Parliament after watching film

July 2, 2026
California man gets 1 year in jail time for 2023 death of pro-Israel protester

California man gets 1 year in jail time for 2023 death of pro-Israel protester

July 2, 2026
Elections Committee chairman orders Benjamin Netanyahu to delete video from social media

Elections Committee chairman orders Benjamin Netanyahu to delete video from social media

July 2, 2026
JD Vance says technical negotiations with Iran underway in Doha

JD Vance says technical negotiations with Iran underway in Doha

July 2, 2026
À la télévision, Nawaf Salam dissipe la confusion autour de « l’accord‑cadre »

À la télévision, Nawaf Salam dissipe la confusion autour de « l’accord‑cadre »

July 2, 2026
5.3-magnitude earthquake shakes Islamabad, Lahore and parts of KP

5.3-magnitude earthquake shakes Islamabad, Lahore and parts of KP

July 2, 2026
Lock Upp 2: Shreya Kalra, Sufi Motiwala Abuse Each Other In Ugly Verbal Fight

Lock Upp 2: Shreya Kalra, Sufi Motiwala Abuse Each Other In Ugly Verbal Fight

July 2, 2026
Sri Lanka reclassified as an Upper-Middle-Income Country by the World Bank – Sri Lanka Mirror – Right to Know. Power to Change

Sri Lanka reclassified as an Upper-Middle-Income Country by the World Bank – Sri Lanka Mirror – Right to Know. Power to Change

July 1, 2026
Heavy rain and strong winds bring cooler weather to Khanpur Hazara

Heavy rain and strong winds bring cooler weather to Khanpur Hazara

July 1, 2026
Thursday, July 2, 2026
  • About us
  • Advertise with us
  • Submit Articles
  • Privacy Policy
  • Contact us
Asia Today
No Result
View All Result
Subscribe
  • Login
  • Eastern Asia
    • China
    • Japan
    • Mongolia
    • North Korea
    • South Korea
  • South-eastern Asia
    • Brunei
    • Cambodia
    • Indonesia
    • Laos
    • Malaysia
    • Myanmar
    • Philippines
    • Singapore
    • Thailand
    • Timor Leste
    • Vietnam
  • Southern Asia
    • Afghanistan
    • Bangladesh
    • Bhutan
    • India
    • Iran
    • Maldives
    • Nepal
    • Pakistan
    • Sri Lanka
  • Central Asia
    • Kazakhstan
    • Kyrgyzstan
    • Tajikistan
    • Turkmenistan
    • Uzbekistan
  • Western Asia
    • Armenia
    • Azerbaijan
    • Bahrain
    • Cyprus
    • Georgia
    • Iraq
    • Israel
    • Jordan
    • Kuwait
    • Lebanon
    • Oman
    • Qatar
    • Saudi Arabia
    • State of Palestine
    • Syria
    • Turkey
    • United Arab Emirates
    • Yemen
  • More News
    • Opinion
    • Politics
    • Business
    • Entertainment
    • Fashion
    • Food
    • Health
    • Lifestyle
    • Science
    • Tech
    • Sports
  • Eastern Asia
    • China
    • Japan
    • Mongolia
    • North Korea
    • South Korea
  • South-eastern Asia
    • Brunei
    • Cambodia
    • Indonesia
    • Laos
    • Malaysia
    • Myanmar
    • Philippines
    • Singapore
    • Thailand
    • Timor Leste
    • Vietnam
  • Southern Asia
    • Afghanistan
    • Bangladesh
    • Bhutan
    • India
    • Iran
    • Maldives
    • Nepal
    • Pakistan
    • Sri Lanka
  • Central Asia
    • Kazakhstan
    • Kyrgyzstan
    • Tajikistan
    • Turkmenistan
    • Uzbekistan
  • Western Asia
    • Armenia
    • Azerbaijan
    • Bahrain
    • Cyprus
    • Georgia
    • Iraq
    • Israel
    • Jordan
    • Kuwait
    • Lebanon
    • Oman
    • Qatar
    • Saudi Arabia
    • State of Palestine
    • Syria
    • Turkey
    • United Arab Emirates
    • Yemen
  • More News
    • Opinion
    • Politics
    • Business
    • Entertainment
    • Fashion
    • Food
    • Health
    • Lifestyle
    • Science
    • Tech
    • Sports
No Result
View All Result
Morning News
No Result
View All Result
Home South-eastern Asia Laos

Dnotitia Unveils STAR-KV, Achieving UP to 20x KV Cache Compression, Selected as an ICML 2026 Spotlight Paper

by Asia Today Team
July 2, 2026
in Laos
Reading Time: 3 mins read
21 0
A A
0
Dnotitia Unveils STAR-KV, Achieving UP to 20x KV Cache Compression, Selected as an ICML 2026 Spotlight Paper
24
SHARES
302
VIEWS
Share on FacebookShare on Twitter

READ ALSO

First Floods Hit Northern Laos After Heavy Rains

Laos Extends Condolences After Deadly Venezuela Earthquake


  • Introduces a low-rank-based strategy to KV cache compression, one of many key bottlenecks in long-context AI
  • Hurries up consideration computation by as much as 6.9x and total era throughput by as much as 3.1x, shifting past reminiscence financial savings to sooner inference
  • Chosen as a Highlight paper at ICML 2026, representing about 2.2% of reviewed submissions and about 8.4% of accepted papers
  • Following the eye round Google’s TurboQuant at ICLR 2026, STAR-KV presents one other strategy to advancing KV cache compression
  • Paper out there on arXiv; supply code launched on GitHub

SEOUL, South Korea, July 2, 2026 /PRNewswire/ — Dnotitia Inc. (Dnotitia), an organization specializing in long-term reminiscence AI and semiconductor-based AI infrastructure applied sciences, has launched the paper and supply code for “STAR-KV: Low-Rank KV Cache Compression through Smooth Thresholding for Adaptive Rank Management.” The know-how was developed by means of a joint analysis effort involving UC San Diego’s VVIP Lab and Dnotitia researchers, and the paper was chosen as a Highlight paper at ICML 2026 (Worldwide Convention on Machine Studying 2026), one of many world’s main conferences in machine studying.

Dnotitia Unveils STAR-KV, Achieving UP to 20x KV Cache Compression, Selected as an ICML 2026 Spotlight Paper
Dnotitia contributed STAR-KV, chosen as an ICML 2026 Highlight Paper, attaining as much as 20x KV cache compression and sooner inference by means of low-rank compression and GPU optimization

Within the experiments reported within the paper, low-rank compression alone diminished the KV cache by as much as 75%. Mixed with the mixed-precision quantization methodology proposed within the paper, STAR-KV compressed the total KV cache by as much as 20x. The know-how additionally improves computation pace by means of customized GPU kernels, growing consideration computation pace by as much as 6.9x and total era throughput by as much as 3.1x. STAR-KV additionally confirmed larger accuracy than main current KV cache compression strategies.

KV cache compression has turn out to be a key technical problem in AI infrastructure. As analysis into lowering the reminiscence bottleneck of long-context AI positive aspects momentum, together with the eye round Google’s TurboQuant at ICLR 2026, STAR-KV presents a brand new strategy that mixes low-rank compression with quantization and GPU execution optimization.

The KV cache is short-term reminiscence saved on the GPU in order that a big language mannequin (LLM) doesn’t need to recompute context it has already processed. As AI evolves into agentic programs that use a number of paperwork, dialog historical past, code, search outcomes, and outputs from exterior instruments, the quantity of context a mannequin should course of is rising quickly. On this surroundings, the KV cache has emerged as a key bottleneck affecting each GPU reminiscence utilization and inference price.

In line with the STAR-KV paper, when a LLaMA-3.1-8B mannequin processes a 128K-token context at a batch dimension of 4, the KV cache accounts for about 81% of complete GPU reminiscence. As long-context AI turns into extra extensively used, KV cache compression is more and more seen as a core AI infrastructure know-how for processing lengthy context at decrease price.

ICML, the place the STAR-KV paper was accepted, is extensively thought to be one of many prime worldwide conferences in AI and machine studying, alongside NeurIPS and ICLR. ICML 2026 will likely be held from July 6 to 11 at COEX in Seoul. This yr, 23,918 papers entered evaluate, 6,352 had been accepted, and 536 had been chosen as Highlight papers. Highlight papers account for about 2.2% of all reviewed submissions and about 8.4% of accepted papers.

Going ahead, Dnotitia plans to additional advance STAR-KV to be used in real-world AI service environments and discover its software to open-source LLM inference frameworks akin to vLLM.

“Applied sciences that assist AI course of longer context sooner and at decrease price are advancing quickly” mentioned MK Chung, CEO of Dnotitia. “STAR-KV addresses the core bottlenecks in KV cache capability and a spotlight processing pace, and Dnotitia goals to contribute to the AI inference ecosystem by means of open sourcing.”



Source link

Tags: 20xachievingcacheCompressionDnotitiaICMLpaperSelectedspotlightSTARKVunveils

Related Posts

First Floods Hit Northern Laos After Heavy Rains
Laos

First Floods Hit Northern Laos After Heavy Rains

July 1, 2026
Laos Extends Condolences After Deadly Venezuela Earthquake
Laos

Laos Extends Condolences After Deadly Venezuela Earthquake

June 29, 2026
Greenhouse and hipages Group Win the 2026 TIARA Long-Term Partnership Award
Laos

Greenhouse and hipages Group Win the 2026 TIARA Long-Term Partnership Award

June 30, 2026
Funded 100% Within One Hour: The Innovation Behind DESLOC V150 Plus
Laos

Funded 100% Within One Hour: The Innovation Behind DESLOC V150 Plus

June 28, 2026
Bank of China (Hong Kong) x Television Broadcasts Limited (“TVB”) “Wealth Management Expo 2026” was Successfully Held
Laos

Bank of China (Hong Kong) x Television Broadcasts Limited (“TVB”) “Wealth Management Expo 2026” was Successfully Held

June 29, 2026
Landis+Gyr Shareholders Approve All Proposals
Laos

Landis+Gyr Shareholders Approve All Proposals

June 27, 2026
Asia Today

Copyright © 2022 Asia Today.

Navigate Site

  • Disclaimer
  • Privacy Policy
  • Cookie Privacy Policy
  • DMCA
  • Terms and Conditions
  • Contact us

Follow Us

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • Homepages
  • World
  • Eastern Asia
    • China
    • Japan
    • Mongolia
    • North Korea
    • South Korea
  • South-eastern Asia
    • Brunei
    • Cambodia
    • Indonesia
    • Laos
    • Malaysia
    • Myanmar
    • Philippines
    • Singapore
    • Thailand
    • Timor Leste
    • Vietnam
  • Southern Asia
    • Afghanistan
    • Sri Lanka
    • Bangladesh
    • Bhutan
    • India
    • Iran
    • Maldives
    • Nepal
    • Pakistan
    • Central Asia
    • Kazakhstan
    • Kyrgyzstan
    • Tajikistan
    • Turkmenistan
    • Uzbekistan
  • Western Asia
    • Armenia
    • Azerbaijan
    • Bahrain
    • Cyprus
    • Georgia
    • Iraq
    • Israel
    • Jordan
    • Kuwait
    • Lebanon
    • Oman
    • Qatar
    • Saudi Arabia
    • State of Palestine
    • Syria
    • Turkey
    • United Arab Emirates
    • Yemen
  • Opinion
  • Politics
  • Business
  • Entertainment
  • Fashion
  • Food
  • Health
  • Lifestyle
  • Science
  • Tech
  • Travel
  • Sports
  • About us
  • Advertise with us
  • Privacy Policy
  • Contact us
  • Support AsiaToday

Copyright © 2022 Asia Today.