Thursday, Nov 14, 2024

New Apple breakthrough makes Apple GPT on next year’s iPhone even more exciting

Apple GPT might soon become a reality. During the past few months, we heard several reports about this learning language model is working on. For example, The Information posted that Apple is spending millions of dollars daily to train its LLM.

While the publication says most of this investment would focus on AppleCare customers, the Siri team plans to incorporate these language models to make complex shortcut integrations more accessible. In addition, Haitong International Securities analyst Jeff Pu has reported that Apple has been building a few hundred AI servers throughout 2023 and plans to add more in 2024.

He believes that Apple plans to combine cloud-based AI and on-device data processing to release its generative AI to iPhone and iPad users by late 2024, during the iOS 18 cycle. Since we're all looking forward to this Apple GPT technology to land on our iPhones, one small detail would set this GPT apart from the others: on-device usage instead of cloud-based.

While Pu believes Apple will mix both, the company is a big advocate of privacy as a "fundamental human right," so mainly relying on on-device processing would be a key differentiator from all the other companies. But since Large Language Models are... large, it means an iPhone technically wouldn't be able to run this future Apple GPT locally because it would need a proper server to do that.

That said, some Apple researchers published a paper showing how they could efficiently use Large Language Models with limited memory, which is very exciting.

In this paper, first spotted by MacRumors, the researchers say that the "method involves constructing an inference cost model that harmonizes with the flash memory behavior, guiding us to optimize in two critical areas: reducing the volume of data transferred from flash and reading data in larger, more contiguous chunks." By doing that, the company plans to use two new technologies:

  • Windowing: It loads parameters for only the past few tokens, reusing activations from recently computed tokens. This sliding windows approach reduces the number of IO requests to load weights.
  • Row-column bundling: It stores a concatenated row and column of the up-projection and down-projection layers to read bigger contiguous chunks from flash memory. This increases throughput by reading larger chunks.

The combination of methods could bring a 4-5 times increase in speed on CPUs and 20-25 times faster GPUS, which would allow AI models to run up to twice the size of the iPhone's memory. At the end of the day, this technology could improve Siri's capabilities, real-time translation, and other AI features for photos, videos, and understanding of how customers use their iPhones.

Don't Miss: Apple’s ChatGPT AI rival rumored to hit iPhone next year in iOS 18

The post New Apple breakthrough makes Apple GPT on next year’s iPhone even more exciting appeared first on BGR.

Today's Top Deals

  1. Today’s deals: $329 Apple Watch Series 9, $20 AirTags, $99 Bose speaker, ASUS laptops, Ring Indoor Cam, more
  2. M2 MacBook Air 15-inch hits new all-time low of $999, a $300 discount
  3. Amazon gift card deals, offers & coupons 2023: Get $390+ free

Trending Right Now:

  1. Apple TV+ has 4 of the most popular shows on any streamer right now
  2. Star Wars release dates: Every announced movie and TV show
  3. The Regime: HBO’s upcoming political satire starring Kate Winslet releases its first trailer
------------
Read More
By: José Adorno
Title: New Apple breakthrough makes Apple GPT on next year’s iPhone even more exciting
Sourced From: bgr.com/tech/new-apple-breakthrough-makes-apple-gpt-on-next-years-iphone-even-more-exciting/
Published Date: Thu, 21 Dec 2023 13:28:01 +0000

Did you miss our previous article...
https://trendinginbusiness.business/technology/beeper-throws-in-the-towel-and-abandons-imessage-support-on-android