Add API request for generate audio using Microsoft Edge's online text-to-speech service #31

TheMhv · 2024-09-21T13:54:32Z

Summary by CodeRabbit

New Features
- Introduced text-to-speech (TTS) functionality with new endpoints for TTS requests and voice retrieval.
- Added a new endpoint to retrieve user information.
Dependencies
- Integrated new dependencies: fastapi, pydantic, python-multipart, and edge-tts for enhanced application capabilities.

coderabbitai · 2024-09-21T13:54:38Z

Walkthrough

The changes involve updates to the pyproject.toml and requirements.txt files to include new dependencies: fastapi, pydantic, python-multipart, and edge-tts. Additionally, the rvc_python/api.py file has been modified to introduce new text-to-speech functionality, including a data model for TTS requests and two new endpoints for listing available voices and processing TTS requests. Error handling has been implemented for the new features.

Changes

File	Change Summary
pyproject.toml, requirements.txt	New dependencies added: `fastapi`, `pydantic`, `python-multipart`, and `edge-tts`.
rvc_python/api.py	New data model `TTSRequest` created. Added endpoints: `GET /voices` for listing voices and `POST /tts` for processing TTS requests. New method `get_user_info` also added. Error handling implemented.

Sequence Diagram(s)

sequenceDiagram
    participant Client
    participant API
    participant TTSService

    Client->>API: GET /voices
    API->>TTSService: List available voices
    TTSService-->>API: Return voices list
    API-->>Client: Return voices list

    Client->>API: POST /tts with TTSRequest
    API->>TTSService: Process TTS request
    TTSService-->>API: Generate audio
    API-->>Client: Return audio in WAV format

🐰 In the code, we hop and play,
New features brighten up the day!
With voices sweet and TTS bright,
Our app now sings with pure delight!
Dependencies added, oh what fun,
A joyful leap, the work is done! 🎶

Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

Share

Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
-- I pushed a fix in commit <commit_id>, please review it.
-- Generate unit testing code for this file.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
-- @coderabbitai generate unit testing code for this file.
-- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
-- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
-- @coderabbitai read src/utils.ts and generate unit testing code.
-- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
-- @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

coderabbitai

Actionable comments posted: 5

Outside diff range and nitpick comments (1)

pyproject.toml (1)
39-39: Consider specifying the version of the dependency.

To ensure compatibility in the future, it's a good practice to specify the version of the dependency. For example:
-  "edge-tts",
+  "edge-tts==1.0.0",

Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

Commits

Files that changed from the base of the PR and between ff061a0 and 593eebb.

Files selected for processing (3)

pyproject.toml (1 hunks)
requirements.txt (1 hunks)
rvc_python/api.py (3 hunks)

Additional context used

Ruff

rvc_python/api.py

159-159: Within an except clause, raise exceptions with raise ... from err or raise ... from None to distinguish them from errors in exception handling

(B904)

Additional comments not posted (3)

requirements.txt (2)

18-18: LGTM!

The addition of the python-multipart dependency aligns with the PR objectives. It is a commonly used library for handling multipart form data in web applications and APIs, particularly for file uploads.

Given that the PR involves an API request for generating audio, it is reasonable to assume that this dependency will be used to send audio files or data to the API.

19-19: Looks good!

The edge-tts dependency is a great fit for the PR objectives. It provides a Python interface to interact with Microsoft Edge's online text-to-speech service, which is exactly what the PR aims to achieve.

By adding this dependency, the project gains the ability to convert text to speech using the various voices provided by the Edge TTS service. This aligns perfectly with the goal of generating audio using Microsoft's service.

pyproject.toml (1)

39-39: LGTM!

The addition of the edge-tts dependency is consistent with the PR objectives and is necessary to support the new feature.

coderabbitai · 2024-09-21T13:57:06Z

rvc_python/api.py


+class TTSRequest(BaseModel):
+    text: str
+    voice: str | None = "Microsoft Server Speech Text to "


Complete the default value for voice in TTSRequest

The default value for voice appears incomplete: "Microsoft Server Speech Text to ". Please provide a complete and valid default voice identifier to ensure the TTS service functions correctly.

Apply this diff to fix the default voice value:

- voice: str | None = "Microsoft Server Speech Text to " + voice: str | None = "en-US-AriaNeural"

Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

voice: str | None = "Microsoft Server Speech Text to "

voice: str | None = "en-US-AriaNeural"

coderabbitai · 2024-09-21T13:57:06Z

rvc_python/api.py

+    @app.get("/voices")
+    async def list_voices():
+        return JSONResponse(content={"voices": await edge_tts.list_voices()})


Add exception handling to the /voices endpoint

Currently, the list_voices endpoint does not handle exceptions that may occur when calling edge_tts.list_voices(). This could lead to unhandled exceptions and a 500 Internal Server Error response without meaningful information to the client. Consider adding error handling to provide a clear error message.

Apply this diff to incorporate exception handling:

@app.get("/voices") async def list_voices(): + try: voices = await edge_tts.list_voices() return JSONResponse(content={"voices": voices}) + except Exception as e: + logger.error(f"Error retrieving voices: {e}") + raise HTTPException(status_code=500, detail="Failed to retrieve voices.") from e

Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

@app.get("/voices")

async def list_voices():

return JSONResponse(content={"voices": await edge_tts.list_voices()})

@app.get("/voices")

async def list_voices():

try:

voices = await edge_tts.list_voices()

return JSONResponse(content={"voices": voices})

except Exception as e:

logger.error(f"Error retrieving voices: {e}")

raise HTTPException(status_code=500, detail="Failed to retrieve voices.") from e

coderabbitai · 2024-09-21T13:57:06Z

rvc_python/api.py

+            )
+            await communicate.save(input_path)
+
+            app.state.rvc.infer_file(input_path, output_path)


Avoid blocking the event loop with synchronous infer_file method

The app.state.rvc.infer_file method appears to be a synchronous and potentially long-running operation. Executing it directly in an async endpoint could block the event loop, affecting the application's performance. Consider running it in a separate thread using asyncio.to_thread.

Apply this diff to run infer_file without blocking:

- app.state.rvc.infer_file(input_path, output_path) + await asyncio.to_thread(app.state.rvc.infer_file, input_path, output_path)

Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

app.state.rvc.infer_file(input_path, output_path)

await asyncio.to_thread(app.state.rvc.infer_file, input_path, output_path)

coderabbitai · 2024-09-21T13:57:06Z

rvc_python/api.py

+
+            app.state.rvc.infer_file(input_path, output_path)
+
+            output_data = tmp_output.read()


Reset the file pointer before reading tmp_output

After writing to tmp_output, the file pointer is at the end of the file. Before reading from it, you need to reset the file pointer to the beginning using tmp_output.seek(0); otherwise, output_data may be empty.

Apply this diff to reset the file pointer:

tmp_output.close() + tmp_output = open(output_path, 'rb') + output_data = tmp_output.read() + tmp_output.close()

Alternatively, seek to the beginning before reading:

tmp_output.seek(0) output_data = tmp_output.read()

Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

output_data = tmp_output.read()

tmp_output.seek(0)

output_data = tmp_output.read()

coderabbitai · 2024-09-21T13:57:06Z

rvc_python/api.py

+            return Response(content=output_data, media_type="audio/wav")
+        except Exception as e:
+            logger.error(e)
+            raise HTTPException(status_code=500, detail=f"An error occurred: {str(e)}")


Chain exceptions when raising HTTPException

When raising a new exception within an except block, it's good practice to chain it using from e to preserve the original traceback. This helps in distinguishing exceptions in error handling from those in the main code.

Apply this diff to chain exceptions:

logger.error(e) - raise HTTPException(status_code=500, detail=f"An error occurred: {str(e)}") + raise HTTPException(status_code=500, detail=f"An error occurred: {str(e)}") from e

Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

raise HTTPException(status_code=500, detail=f"An error occurred: {str(e)}")

raise HTTPException(status_code=500, detail=f"An error occurred: {str(e)}") from e

Tools

Ruff

159-159: Within an except clause, raise exceptions with raise ... from err or raise ... from None to distinguish them from errors in exception handling

(B904)

TheMhv added 10 commits September 21, 2024 09:24

Feat: Create a tts api route

f59a4ee

Fix: Request parameters for tts

889ac0f

Feat: Add more params into tts request

109eebf

Feat: Add more params into tts request

56699c1

Fix: Params from tts request

80ea872

Fix: Params from tts request

a134dd1

Fix: Params from tts request

3c650ba

Feat: TTS convert directly from request

f2b1d0a

Feat: List voices request

381f395

Fix: async function error

593eebb

TheMhv changed the title ~~API request for generate audio using Microsoft Edge's online text-to-speech service~~ Add API request for generate audio using Microsoft Edge's online text-to-speech service Sep 21, 2024

coderabbitai bot reviewed Sep 21, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add API request for generate audio using Microsoft Edge's online text-to-speech service #31

Add API request for generate audio using Microsoft Edge's online text-to-speech service #31

Uh oh!

TheMhv commented Sep 21, 2024 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Sep 21, 2024 •

edited

Loading

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (`.coderabbit.yaml`)

Documentation and Community

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot Sep 21, 2024

Uh oh!

coderabbitai bot Sep 21, 2024

Uh oh!

coderabbitai bot Sep 21, 2024

Uh oh!

coderabbitai bot Sep 21, 2024

Uh oh!

coderabbitai bot Sep 21, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

	voice: str \| None = "Microsoft Server Speech Text to "
	voice: str \| None = "en-US-AriaNeural"

	app.state.rvc.infer_file(input_path, output_path)
	await asyncio.to_thread(app.state.rvc.infer_file, input_path, output_path)


		app.state.rvc.infer_file(input_path, output_path)

		output_data = tmp_output.read()

	output_data = tmp_output.read()
	tmp_output.seek(0)
	output_data = tmp_output.read()

	raise HTTPException(status_code=500, detail=f"An error occurred: {str(e)}")
	raise HTTPException(status_code=500, detail=f"An error occurred: {str(e)}") from e

Add API request for generate audio using Microsoft Edge's online text-to-speech service #31

Are you sure you want to change the base?

Add API request for generate audio using Microsoft Edge's online text-to-speech service #31

Uh oh!

Conversation

TheMhv commented Sep 21, 2024 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Sep 21, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (.coderabbit.yaml)

Documentation and Community

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Sep 21, 2024

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Sep 21, 2024

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Sep 21, 2024

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Sep 21, 2024

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Sep 21, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

TheMhv commented Sep 21, 2024 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Sep 21, 2024 •

edited

Loading

CodeRabbit Configuration File (`.coderabbit.yaml`)