2026.4.2-OmniVoice：小米开源的TTS模型

1、OmniVoice：小米开源的TTS模型 支持多国语言，支持语音克隆，克隆效果可以与IndexTTS 2一战。 Github：https://github.com/k2-fsa/OmniVoice 模型：https://huggingface.co/k2-fsa/OmniVoice

omniVoice.mp4

2、字节开放Seedance 2.0 API申请 目前仅企业用户可以申请，申请地址：https://www.volcengine.com/contact/seedance2-0public

3、微软发布语音识别模型MAI-Transcribe-1 准确率比11Labs的Scribe v2还要高。官方介绍：https://microsoft.ai/news/state-of-the-art-speech-recognition-with-mai-transcribe-1/

4、Claude Code 2.1.90新增教学功能 输入/powerup，可以触发交互式学习，在Claude Code里学习如何使用Claude Code，新手和进阶都值得一学。

Banana Pro国内直连版 我做的小产品。无需VPN，填写key就能用，操作简单，适合小白。买了可以给别人用，搭个人情也不错。所有分辨率（1K/2K/4K）都是3毛一张，极具性价比，也可以同时使用我做的AI PPT功能快速生成大厂风格PPT。Banana Pro已经是工具，其他AI绘画还只是玩具。

使用Banana Pro：https://gordensun.github.io/NanoBananaPro/ 使用AI PPT：https://gordensun.github.io/NanoBananaProPPT/

我的公众号：AI加速派分享国内可以直接操作的前沿教程，而且教程里的token和key我都承包了，你甚至不用注册账号就能跑通。