Video Editing & Tone Tuning (Virtual Singer)¶

Important

I am a skilled tone tuner of the virtual singer software UTAU. I upload my work to the site as videos, so yes, I know how to do video editing and bitrate-compressing as well.

I have also purchased the advanced tuning software Synthesizer V and play it around for a while.

Warning

You can check most of my video samples from my Bilibili space. However, some video works are in heavy Kichiku (鬼畜) or spoof style and may cause DISCOMFORT. If you do not know much about this video style, I recommend not trying or at least watching with caution.

Video Editing¶

There is nothing too much to explain about video editing. Video creators who love animation (yes I am one of them) will use images, videos, music or other media to edit and recreate their favorite animation works, and the final video result is called MAD or AMV (technically there is a difference between the two, but I won't go into it here). The purpose of the creation is often to express love for the original work (or the characters or story within it), or to make an adaptation of the original story.

The following is a MAD I created in 2014 using the music One Year, Two Months and Twenty Days (1年2个月又20天) in conjunction with the animation 5cm per Second (秒速五厘米). Unfortunately, the video editing really takes a lot of time and I had little time to create another one after this.

Tone Tuning¶

Many people may not understand what ”Tone Tuning“ is. Simply put, it's about making virtual singers' voices match the beat, pitch, and emotion of a song so that they can sing it.

The most famous virtual singer in the world is Hatsune Miku (初音ミク). If you haven't heard her name, I'm sure you've heard her coverf of Ievan Polkka (Video link). Here is a portrait of Hatsune Miku from the official Crypton website:

Virtual singers only have the sound of individual words or syllables read aloud (without intonation) in their voice bank, so to get them to sing a full song, a technician (i.e., a tuner) must be available to work on the pitch and beat of their pronunciation. In addition to these basic adjustments, to make the final song sound natural, beautiful, and close to a real person singing it, a good tone tuner will also deal with:

Vocal emotion,
Vibrato,
Glissando,
Ornamentation,
Singer falsetto, and many other things.

The voice software (synthesizer) behind Miku is Vocaloid, initially made by Yamaha in 2004. However, Vocaloid has not been called a success until the introduction of Hatsune Miku in 2007. After then, many virtual singers have been released for free (or sold commercially), and they are available for different synthesizers. However, I didn't know much about Vocaloid because its price is too high for amateurs like me.

In the 2010s, the most famous free synthesizer is UTAU and that is exactly the software I am familiar with. I started my tuner hobby in 2014, right after the release of virtual singer Ge Ping. I am grateful for his generosity on the virtual singer, which allows me to explore the fun of music in this way.

Here are some samples tuned by me, covered by the virtual singer Ge Ping (葛平).

One Night in Beijing (北京一夜): The falsetto vocal effect is simulated in the chorus. I tuned this in 2015.
Like Smoke (如烟): Gentle and lyrical singing style. I tuned this in 2016.

You may have noticed that these tone tuning samples are all covered by Ge Ping. Who is him? Here is a brief introduction of the tone tuning history behind him.

Note

In China, Ge Ping (葛平, 1960--), who was famous for being the main voice actor of Chinese animation 3000 Whys of Blue Cat (蓝猫淘气三千问), is an important figure who can't be bypassed and who drives the development of Chinese virtual singers.

Initially, a video of his interview has been spoofed by netizens for a long time. Netizens took pleasure in making fun of his standard pronunciation and interview stories, and sometimes used the video as music material. Instead of getting angry after finding out, Ge Ping said he had always regretted not becoming a singer as well, and wanted to learn new trends from young people and contribute his voice to help fans of virtual singers realize their dreams.

In 2013, Ge Ping recorded his own voice in the studio as a sound source of a virtual singer (UTAU compatible), and released it publicly (Ge Ping Voice Release). He voluntarily gave up his commercial rights and made the virtual singer available to everyone at no cost for tone tuning and music creation. This is also the highest quality Chinese virtual singer voicebank in a long time. Here is the portrait of the virtual singer Ge Ping from UTAU China group (designed by 十月鬼).

A portrait of the virtual singer Ge Ping (Painter: 十月鬼).

After 2019, the strongest synthesizer probably became the paid software Synthesizer V, which also has many Chinese voicebank under it, such as Xingchen (星尘), Chiyu (赤羽), and Cangqiong (苍穹). You can visit the albums of this generation of virtual songstresses (Xingchen/Cangqiong/Shian's First Album - Jungle Rules, in Chinese) to try them out, and you will find that their singing is almost indistinguishable from that of real people.

The level of software technology is extremely powerful, allowing tuners to make virtual singers' singing comparable to real vocal many times easier than it was in the early days of the Vocaloid/UTAU era. I haven't had a chance to try it out yet, but will likely acquire it in the future.