Skip to content

Document support for STT in Google Generative AI #39721

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 3 commits into from
Jul 17, 2025
Merged

Conversation

tronikos
Copy link
Member

@tronikos tronikos commented Jun 26, 2025

Proposed change

Document support for STT in Google Generative AI

Type of change

  • Spelling, grammar or other readability improvements (current branch).
  • Adjusted missing or incorrect information in the current documentation (current branch).
  • Added documentation for a new integration I'm adding to Home Assistant (next branch).
  • Added documentation for a new feature I'm adding to Home Assistant (next branch).
  • Removed stale or deprecated documentation.

Additional information

  • Link to parent pull request in the codebase: Add Google AI STT core#147563
  • Link to parent pull request in the Brands repository:
  • This PR fixes or closes issue: fixes #

Checklist

  • This PR uses the correct branch, based on one of the following:
    • I made a change to the existing documentation and used the current branch.
    • I made a change that is related to an upcoming version of Home Assistant and used the next branch.
  • The documentation follows the Home Assistant documentation standards.

Summary by CodeRabbit

  • Documentation
    • Updated integration documentation to include speech-to-text capabilities for Google Generative AI.
    • Clarified that the integration now supports conversation agent, speech-to-text, and text-to-speech entities.
    • Reordered feature categories for improved clarity.

@home-assistant home-assistant bot added has-parent This PR has a parent PR in another repo next This PR goes into the next branch labels Jun 26, 2025
Copy link

netlify bot commented Jun 26, 2025

Deploy Preview for home-assistant-docs ready!

Name Link
🔨 Latest commit 9c09cf5
🔍 Latest deploy log https://app.netlify.com/projects/home-assistant-docs/deploys/6878ce9ad423a2000881c1d8
😎 Deploy Preview https://deploy-preview-39721--home-assistant-docs.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

Copy link
Contributor

coderabbitai bot commented Jun 26, 2025

📝 Walkthrough

Walkthrough

The documentation for the Google Generative AI integration was updated to include information about new speech-to-text capabilities. The order of categories was adjusted, the supported platforms list was expanded to include speech-to-text, and descriptive text was revised to mention the new functionality.

Changes

File(s) Change Summary
source/_integrations/google_generative_ai_conversation.markdown Updated documentation to add speech-to-text support, reordered categories, and revised description.

Sequence Diagram(s)

sequenceDiagram
    participant User
    participant HomeAssistant
    participant GoogleGenerativeAI

    User ->> HomeAssistant: Speak or type message
    HomeAssistant ->> GoogleGenerativeAI: Send audio/text for processing
    GoogleGenerativeAI -->> HomeAssistant: Return transcribed text (STT) or response
    HomeAssistant -->> User: Provide response as text or speech
Loading

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share
🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

  • Review comments: Directly reply to a review comment made by CodeRabbit. Example:
    • I pushed a fix in commit <commit_id>, please review it.
    • Explain this complex logic.
    • Open a follow-up GitHub issue for this discussion.
  • Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
    • @coderabbitai explain this code block.
    • @coderabbitai modularize this function.
  • PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
    • @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
    • @coderabbitai read src/utils.ts and explain its main purpose.
    • @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
    • @coderabbitai help me debug CodeRabbit configuration file.

Support

Need help? Create a ticket on our support page for assistance with any issues or questions.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

  • @coderabbitai pause to pause the reviews on a PR.
  • @coderabbitai resume to resume the paused reviews.
  • @coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
  • @coderabbitai full review to do a full review from scratch and review all the files again.
  • @coderabbitai summary to regenerate the summary of the PR.
  • @coderabbitai generate docstrings to generate docstrings for this PR.
  • @coderabbitai generate sequence diagram to generate a sequence diagram of the changes in this PR.
  • @coderabbitai resolve resolve all the CodeRabbit review comments.
  • @coderabbitai configuration to show the current CodeRabbit configuration for the repository.
  • @coderabbitai help to get help.

Other keywords and placeholders

  • Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
  • Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
  • Add @coderabbitai or @coderabbitai title anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (.coderabbit.yaml)

  • You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
  • Please see the configuration documentation for more information.
  • If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

  • Visit our Documentation for detailed information on how to use CodeRabbit.
  • Join our Discord Community to get help, request features, and share feedback.
  • Follow us on X/Twitter for updates and announcements.

Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 0

🧹 Nitpick comments (2)
source/_integrations/google_generative_ai_conversation.markdown (2)

16-20: Add a short STT-specific section or note
stt has been correctly appended to ha_platforms, but the page still lacks any STT-specific heading or usage details (similar to the existing TTS and Actions sections). A brief subsection (e.g. “### Speech-to-text”) outlining entity creation, languages, and an example stt.transcribe service call would make the docs more complete.


32-32: Minor grammar / parallelism tweak
The list mixes a single entity (“conversation agent”) with plural “entities”. Consider aligning wording for clarity:

-The Google Generative AI integration adds a conversation agent, speech-to-text, and text-to-speech entities powered by [Google Generative AI](https://ai.google.dev/) to Home Assistant.
+The Google Generative AI integration adds a conversation agent, as well as speech-to-text and text-to-speech entities powered by [Google Generative AI](https://ai.google.dev/) to Home Assistant.
📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro
Cache: Disabled due to data retention organization setting
Knowledge Base: Disabled due to data retention organization setting

📥 Commits

Reviewing files that changed from the base of the PR and between b9549bf and f05a4c0.

📒 Files selected for processing (1)
  • source/_integrations/google_generative_ai_conversation.markdown (3 hunks)
🧰 Additional context used
📓 Path-based instructions (1)
`source/_integrations/*`: Integration documentation must use the provided template structure and section order.

source/_integrations/*: Integration documentation must use the provided template structure and section order.

📄 Source: CodeRabbit Inference Engine (.github/copilot-instructions.md)

List of files the instruction was applied to:

  • source/_integrations/google_generative_ai_conversation.markdown
🪛 LanguageTool
source/_integrations/google_generative_ai_conversation.markdown

[uncategorized] ~32-~32: Possible missing article found.
Context: ...nt can optionally be allowed to control Home Assistant. Controlling Home Assistant ...

(AI_HYDRA_LEO_MISSING_THE)

🔇 Additional comments (1)
source/_integrations/google_generative_ai_conversation.markdown (1)

5-7: Order looks fine – categories are now alphabetically sorted
Placing “Speech-to-text” before “Text-to-speech” and “Voice” keeps the list alphabetised and matches the usual pattern used across other integration docs.

@frenck frenck added this to the 2025.7 milestone Jun 26, 2025
c0ffeeca7
c0ffeeca7 previously approved these changes Jun 26, 2025
Copy link
Contributor

@c0ffeeca7 c0ffeeca7 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you, @tronikos 👍

@c0ffeeca7 c0ffeeca7 added the awaits-parent Awaits the merge of an parent PR label Jun 26, 2025
@tronikos tronikos removed this from the 2025.7 milestone Jul 1, 2025
@home-assistant home-assistant bot added the parent-merged The parent PR has been merged already label Jul 16, 2025
@silamon silamon enabled auto-merge (squash) July 16, 2025 15:36
@c0ffeeca7 c0ffeeca7 closed this Jul 16, 2025
auto-merge was automatically disabled July 16, 2025 15:39

Pull request was closed

@home-assistant home-assistant bot removed parent-merged The parent PR has been merged already awaits-parent Awaits the merge of an parent PR labels Jul 16, 2025
@c0ffeeca7 c0ffeeca7 reopened this Jul 16, 2025
@c0ffeeca7
Copy link
Contributor

closed and reopened to kick netlify

@silamon silamon enabled auto-merge (squash) July 17, 2025 10:21
@silamon silamon merged commit 9ba8e45 into next Jul 17, 2025
9 checks passed
@silamon silamon deleted the tronikos-patch-1 branch July 17, 2025 10:24
@github-actions github-actions bot locked and limited conversation to collaborators Jul 18, 2025
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
has-parent This PR has a parent PR in another repo next This PR goes into the next branch
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants
pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy