Implement Shion(詩音) from SingaBitofHarmony(讓我聽見愛的歌聲) with Python



Nikkie began his career as a software engineer in 2016. He started Python as a hobby in 2017 and fell in love with it. He is engaged in Natural Language Processing as a data scientist at Uzabase, inc. Tokyo, Japan from 2019. He is working on the Python community in Japan as a staff of the following event: - [PyCon Japan]( the largest PyCon in Japan - staff on 2019 and 2020 (Program committee, lead on 2020) - [chair]( on 2021 He gave a talk (and lightning talks) at many PyCons in Japan and abroad. - EuroPython 2020, [PyCon APAC 2020]( (English) He loves anime (Japanese animetation) as much as Python, and implements ideas related to some anime with Python. In 2022, he write code related to "Sing a Bit of Harmony" (e.g. Twitter bot, prototyping AI character, e.t.c.).


    How can we create a program that can speak (not write) with a human? I love anime and fell in love with a movie "Sing a Bit of Harmony"(讓我聽見愛的歌聲). The character, AI (robot) Shion, is very attractive from an engineer's point of view, and I wanted to implement even some of its functions. I implemented, which allows humans to enter text by voice and the script responds by voice. In short, it is like a smart speaker that parrots. In other word, the program reads aloud the spoken texts. I started with an easy implementation (with Web API and OS command) to check the idea and then reworked it with pre-trained machine learning models to get closer to Shion. I will share those implementations with you. I would be happy to provide a little inspiration for your Maker project. Keywords like hashtag: #TTS, #ASR, #subprocess, #SpeechRecognition, #ttslearn #ESPnet, #soundfile, #HuggingFace






    Day 1 • 13:30-14:00 (GMT+8)


    Japanese talk w. English slides