Gemma 4 Models Harness Novel Training to Shrink On-Device Memory Footprint

Google's Gemma 4 models are making strides in efficiency, introducing a cutting-edge training trick that drastically cuts down their memory footprint on devices. This advancement is particularly significant for applications running on edge devices and smartphones, where computational resources and memory are often limited.

Affiliate content

Instant Gaming

Games up to -90% off

Instant key delivery on Instant Gaming

Browse deals →

The core of this innovation lies in a technique known as 'quantization-aware training.' Historically, machine learning models were trained using high-precision data (e.g., 32-bit floating point numbers) and then compressed, or 'quantized,' to lower precision (e.g., 8-bit integers) for deployment on devices. The challenge with this traditional approach was that the post-training quantization often led to a noticeable drop in model accuracy and performance.

With quantization-aware training, the models are trained with the quantization process already in mind. This means the neural network learns to compensate for the effects of lower precision during its training phase, ensuring that when it's eventually quantized for on-device deployment, it retains nearly identical performance to its high-precision counterpart. This results in smaller model sizes, faster inference times, and lower power consumption without sacrificing accuracy. For developers, this means the ability to run more sophisticated AI on less powerful hardware, opening up new possibilities for on-device AI applications and enhancing user privacy by processing data locally.

Recommended

Android Authority1 d ago

OnePlus's US Retreat: A Worrisome Trend for Android's Future

The reported departure of OnePlus from the US smartphone market signals a concerning shift within the Android ecosystem. This move could diminish competition and innovation, potentially leaving consumers with fewer choices and a less diverse field of devices.

Read article

Android Authority1 d ago

Samsung Prioritizes Wider Foldables with Galaxy Z Fold 8 Strategy

Samsung anticipates that the forthcoming Galaxy Z Fold 8, featuring a wider design, will outpace the sales of its premium Z Fold 8 Ultra model. This strategic move indicates a shift in design philosophy, aiming for broader consumer appeal with a more practical form factor.

Read article

Android Authority1 d ago

Google Alerts Android Users About Imminent Cloud Backup Policy Changes

Google is notifying Android users about significant upcoming alterations to its cloud backup storage policies, providing a 45-day heads-up before the new terms take effect. These changes could impact how much data users can store and how their backups are managed, requiring attention from Android device owners.

Read article

Android Authority1 d ago

Unveiling the Core Processing Power of Latest Game Boy-Style Handhelds

Details have emerged regarding the internal hardware powering a new wave of retro-inspired handheld gaming devices, reminiscent of the classic Game Boy. While the standard Air Y model might not impress with its raw power, its appeal could lie in an attractive price point, targeting the nostalgia market effectively.

Read article