Mejoras de IA y Accesibilidad para Android, Chrome

Hogar

Noticias

22 de mayo de 2025

JustinWilliams

# ai # Android # Chrome

While we celebrate Global Accessibility Awareness Day, we are excited to announce significant updates to our products on Android and Chrome, along with new resources for developers working on speech recognition tools. These advancements in AI are making our digital world more accessible and inclusive.

More AI-Powered Innovation with Android

We are taking our commitment to accessibility further by integrating Google AI and Gemini into the fabric of mobile experiences, particularly for vision and hearing.

Enhancing Details with Gemini and TalkBack

Last year, we integrated Gemini's capabilities into TalkBack, Android's screen reader, which provided AI-generated descriptions for images, even when alt text was missing. Now, we are expanding this feature. Users can now ask questions about images they receive, like a friend's new guitar. You can inquire about the make, color, or other elements within the photo. Additionally, you can get descriptions and ask questions about your entire screen. For example, while browsing for sales on a shopping app, you can ask Gemini about the material of an item or check for available discounts.

Use TalkBack's Gemini-powered capabilities to get a description of what's on your screen.

Understanding More of the Emotion Behind Captions

With Expressive Captions, your Android device now provides real-time captions across most apps, capturing not just what's said, but also how it's said. We've added a new duration feature that highlights when words are drawn out, like hearing an "amaaazing shot" in a sports broadcast or a drawn-out "nooooo" in a video message. You'll also get more labels for sounds, such as whistling or throat clearing. This update is available in English for devices running Android 15 and above in the U.S., U.K., Canada, and Australia.

With Expressive Captions' new duration feature, get even more context of what's being said in the audio and video on your phone.

Improving Speech Recognition Around the World

Since launching Project Euphonia in 2019, our aim has been to make speech recognition more accessible for those with non-standard speech patterns. We are now expanding support for developers and organizations worldwide, helping them adapt this technology to more languages and cultural contexts.

New Developer Resources

To foster a global ecosystem of accessible tools, we are offering developers access to our open-source repositories through Project Euphonia’s GitHub page. This allows them to develop personalized audio tools for research or train their models to recognize diverse speech patterns.

Support for New Projects in Africa

Earlier this year, we partnered with Google.org to support the University College London in establishing the Centre for Digital Language Inclusion (CDLI). The CDLI is focused on enhancing speech recognition technology for non-English speakers across Africa. They are creating open-source datasets in 10 African languages, developing new speech recognition models, and supporting the broader community of organizations and developers in this field.

Expanding Accessibility Options for Students

Accessibility tools play a crucial role for students with disabilities, from using facial gestures to navigate Chromebooks with Face Control to customizing their reading experience with Reading Mode. Now, when using Chromebooks with College Board's Bluebook testing app for SAT and Advanced Placement exams, students will have access to all of Google's built-in accessibility features, including ChromeVox screen reader and Dictation, along with College Board's own digital testing tools.

Making Chrome More Accessible

With over 2 billion daily users, we are constantly working to improve Chrome's accessibility. Features like Live Caption and image descriptions for screen reader users are part of this effort.

Accessing PDFs More Easily on Chrome

Previously, scanned PDFs were not accessible to screen readers in desktop Chrome. Now, with Optical Character Recognition (OCR), Chrome can recognize these PDFs, allowing you to highlight, copy, search for text, and use your screen reader to read them.

Reading with Ease with Page Zoom

Page Zoom on Chrome for Android now lets you increase text size without altering the webpage layout or your browsing experience, similar to how it works on Chrome desktop. You can set your zoom preferences to apply to all pages or specific ones.

Page Zoom works with Chrome on Android, letting you customize how you see pages.

To use this feature, simply tap the three-dot menu in the top right corner of Chrome and adjust your zoom settings.

Artículo relacionado

YouTube integra la herramienta de vídeo Veo 3 AI directamente en la plataforma Shorts YouTube Shorts incluirá el modelo de vídeo Veo 3 AI este veranoNeal Mohan, Consejero Delegado de YouTube, reveló durante su discurso en Cannes Lions que la tecnología de generación de vídeo Veo 3 AI d

Google Cloud impulsa grandes avances en la investigación y el descubrimiento científicos La revolución digital está transformando las metodologías científicas gracias a unas capacidades computacionales sin precedentes. Las tecnologías de vanguardia aumentan ahora tanto los marcos teóricos

La inteligencia artificial Grok de Elon Musk pide la opinión del propietario antes de realizar consultas complejas. El recientemente lanzado Grok AI -promocionado por Elon Musk como un sistema de "búsqueda máxima de la verdad"- ha llamado la atención por su tendencia a consultar las declaraciones públicas de Musk a

comentario (9)

0/200

Entregar

DonaldRoberts

26 de agosto de 2025 05:01:14 GMT+02:00

Wow, these AI updates for Android and Chrome sound like a game-changer for accessibility! I'm curious how the speech recognition tools will evolve—could they finally keep up with my fast-talking friends? 😄

OliverAnderson

22 de agosto de 2025 11:01:17 GMT+02:00

Wow, these AI updates for Android and Chrome sound like a game-changer for accessibility! 🥳 Curious how the speech recognition tools will evolve—hope they’re as intuitive as they claim!

AndrewAllen

11 de agosto de 2025 05:01:00 GMT+02:00

C'est génial de voir des progrès en accessibilité ! Les mises à jour pour Android et Chrome vont vraiment aider plus de monde à naviguer facilement. Mais, est-ce que ces outils seront assez intuitifs pour les novices ? 😊 J’espère qu’on verra plus d’innovations comme ça !

GaryPerez

31 de julio de 2025 03:41:20 GMT+02:00

Love how AI is making tech more inclusive! These Android and Chrome updates sound amazing—can't wait to see them in action. 🌟

EricAllen

24 de mayo de 2025 09:29:52 GMT+02:00

Super impressionnant, ces mises à jour pour l’accessibilité ! 🥳 L’IA qui rend le numérique plus inclusif, c’est génial. Mais j’espère que ça ne va pas trop compliquer les choses pour les non-techies.

RichardAdams

24 de mayo de 2025 08:31:39 GMT+02:00

Wow, these AI updates for Android and Chrome sound amazing! 😍 Making tech more inclusive is such a big win. Excited to see how these speech recognition tools evolve!

Noticias principales

Gemini 2.5 Pro ahora ilimitado y más barato que Claude, GPT-4O Generadores de Video AI Top de 2025: Pika Labs vs Alternativas Doblaje AI: Guía Definitiva para la Creación de Voz Realista La IA de Cambium transforma la madera de los desechos en madera Operai mejora el asistente de voz de IA para mejores chats Cómo garantizar que sus datos sean confiables para la integración de IA Notebooklm se expande a nivel mundial, agrega diapositivas y verificación de hechos mejorada Los ajustes a los centros de datos de EE. UU. Podrían desbloquear 76 GW de nueva capacidad de potencia Google utiliza IA para suspender más de 39 millones de cuentas publicitarias por sospecha de fraude Clonación de Voz IA: La guía definitiva para dominar la conversión de voz

Más

Presentado