Banner for TIGER for DiffSinger

Welcome

Hi, I'm Tyler! I'm a vocal synth fan/developer, python coder, and more! You'll find all of my projects and resources here!

What's New?

"TRITON for DiffSinger" has been released!

As of 6/30/24, TRITON for DiffSinger has been released! He is a male "baritenor" English-focused DiffSinger vocal with 4 voice modes! Check out his demo to the right, and download him from his page here!

Project Lineup

Banner for TIGER for DiffSinger
Banner for TRITON for DiffSinger
Banner for DANROU for DiffSinger
Banner for Canary for DiffSinger
Banner for Miyo for DiffSinger
Banner of Leif for DiffSinger
Banner for LabelMakr
Sequence File Repository
Banner for TIGER for DiffSinger

Image of TIGER

Illustration by Aquiboni

TIGER's Logo

TIGER is an energetic, happy-go-lucky 20-something year old tiger-man. He's constantly lost in his head, thinking about what songs he's going to skate to at his roller-rink and what games he's gonna play on his Sega Dreamcast. He has a little buddy named 'Tigrito', who is a virtual pet that can pop out of his Tamagotchi-like device!TIGER was designed by the amazing Static and his official images were illustrated by the astounding Aquiboni.TIGER is a singer on the DiffSinger SVS Engine. While most of his data is recorded in English, he has support for Japanese and phonemes outside of English like Spanish and Portuguese! You can utilize TIGER in the OpenUTAU engine. To learn more about using TIGER, please read this document hosted on his Github page.You will find audio samples of each voice mode below. Please note, the current samples are not rendered from the current build of the voice, so they sound less complete than the current version.Character Information:
Age: 25-29 | Height: 5'7 (5'9 on skates) | Weight: ~250lbs | Nationality: American | Occupation: Performer/Roller Rink Owner
Voice Information:
Provider: tigermeat | Data: ~2.5hrs | Languages: EN, ES, FR, JP, ZH

To download TIGER, click the button above that says "DOWNLOAD". If you need some pointers for installing, check out the usage guide by clicking the button above that says "USAGE GUIDE".


OpenUTAU for Lunai

I recommend using all TGM DiffSinger libraries with "OpenUTAU for Lunai", a fork of OpenUTAU made specifically for DiffSinger. You can download it at the link below! You can read more on Lunai Project's Website!

Image of TIGER's Physical Version

Physical Version

Have you ever wanted to put TIGER on your shelf? Well, now you can!You can purchase TIGER's physical version for $36 USD + Shipping at the page linked below! It comes with his box, some stickers, and a unique serial-number with his commercial license. If you'd like to purchase his commercial license separately, you may purchase it for $30 USD below as well.

Image of TIGER's Physical Version

Boxart by tigermeat


Image of TIGER's MMD Model

MMD MODEL

Thanks to the amazing AlexFloareaVT, TIGER now has an MMD Model! Please see the demonstration below!!


Voice Mode - Fresh

Fresh is the default voice mode for TIGER. It's tone is the most consistent, and is the most dynamic. The high range is bright and has falsetto, while the low range is weaker and airy.
Core Range: A2 to A4 | Falsetto: B4+

Voice Mode - DISCO

Disco deviates from the other voice modes by providing a more soft and mellow tone, without being too soft.
The high, middle and low ranges all have a soft but not too airy tone.
Core Range: A2 to A4 | Falsetto: B4+

Voice Mode - Electric

Electric is a shocking and powerful voice mode. The high range can hit much higher notes before switching to a falsetto, and the middle range is bright and sharp.
Core Range: B2 to C5 | Falsetto: C#5+

Voice Mode - Vinyl

Vinyl is TIGER's most unique voice mode, having a booming and mature tone. With a much greater focus on strong deep lows, this voice mode excels with musical-esque songs.
Core Range: G#2 - G#4 | Falsetto: A4+

Voice Mode - Mystic

Mystic is a new and more experimental voice mode meant to be the polar opposite to Electric: extremely soft and whispered. It currently lacks data and will generally be unstable.
Core Range: B2 - C4 | Falsetto: C4+

Voice Mode - Glam

Glam is a resonant power voice mode that pushes the limits of what TIGER can do! It's a loud rock type voice, with crazy belts and very lose, quirky pronunciation.
Core Range: C3 - C5 | Falsetto: N/A

Voice Mode - Royal

Falsetto is a supplemental voice mode, giving you more control over when TIGER's falsetto begins. It's more consistent in tone compared to the falsetto automatically activated in other voice modes. In the sample, it sounds like a normal voice due to the lack of lower data and parallel machine learning.Strongest Range: F4+

XLS - Cross Language Synthesis

TIGER is an English voice library for DiffSinger, but supports extra phonemes for other languages! Below are some examples of TIGER singing in languages other than English!To see a chart of the phonemes TIGER supports, please check out this page!

Japanese | 日本語

Spanish | Español

French | Français


TIGER © 2021-2024 tigermeatCredits:
- TIGER Voice Provider: tigermeat
- TIGER Design: tigermeat & StaticOceans
- TIGER Original Standing Art: Aquiboni
- TIGER Recording, Development, Labels & Maintenance: tigermeat
- Microphones Used: Neumann TLM 103, AT4040, Rode NT1 Signature
- TIGER Logo: tigermeat
- TIGER MMD Model: AlexFloareaVT
TGM databases are trained with "Millefeuille" to provide support for French.

Image of TIGER

Illustration by JulieRaptor

TRITON's Logo

TRITON is a fun, somewhat aloof 20-something year old human & mermaid hybrid. He spends his days tending to his manatee farm off of the coast of the beach him and his partner TIGER live on. His right hand man is a rowdy manatee named Sheriff Spaghetti, and the two love tending the manatee farm together!TRITON was designed by the amazing Static & tigermeat, and his official images were illustrated by the amazing JulieRaptorTRITON is a singer on the DiffSinger SVS Engine. TRITON is recorded in English only, but by utilizing the Cross-Language compatibility of the TGM Dataset, he can sing in many different languages! You can utilize TRITON in the OpenUTAU for Lunai editor. To learn more about using TRITON, please read this document hosted on his Github page.You will find audio samples of each voice mode below.Character Information:
Age: 25-29 | Height: 5'6 | Weight: ~230lbs | Nationality: American Merman | Occupation: Manatee Farmer
Voice Information:
Provider: Ryan M. | Data: ~1hr | Languages: EN Only

To download TRITON, click the button above that says "DOWNLOAD". If you need some pointers for installing, check out the usage guide by clicking the button above that says "USAGE GUIDE".


OpenUTAU for Lunai

I recommend using all TGM DiffSinger libraries with "OpenUTAU for Lunai", a fork of OpenUTAU made specifically for DiffSinger. You can download it at the link below! You can read more on Lunai Project's Website!

Image of TIGER's Physical Version

Voice Mode - Gale

Gale is the default voice mode for TRITON. It's tone is the most consistent, and is the most dynamic. The voice is very consistent in range, and his pronunciation is loose and indie.
Core Range: F2 to E4 | Falsetto: G4+

Voice Mode - Deluge

Deluge is the most unique voice mode for TRITON. While it's not very stable, it has the most complex tone of all voice modes. It's very weak, but not fully soft. The pronunciation is very loose, which works well for indie music & ballads.
Core Range: F2 to E4 | Falsetto: E4+

Voice Mode - Tempest

Tempest is the most powerful voice mode for TRITON. It has a solid tone, with much more power in the high range. It gives a lot more kick than TRITON's other voice modes.
Core Range: A2 to G4 | Falsetto: A#4+

Voice Mode - Flurry

Flurry is a falsetto-focused voice mode for TRITON. It was recorded almost fully in falsetto, but maintains a very muted and soft lower range. It has a full sound that TRITON's other voice modes do not.
Core Range: F2 to D5 | Falsetto: D4+

XLS - Cross Language Synthesis

TRITON is an English voice library for DiffSinger, but supports extra phonemes for other languages! Below are some examples of TRITON singing in languages other than English!NOTE: TRITON's XLS results may be harder to understand compared to other TGM voice's as TRITON is recorded strictly in English.To see a chart of the phonemes TRITON supports, please check out this page!

Japanese | 日本語

Spanish | Español

French | Français


TRITON © 2021-2024 tigermeatCredits:
- TRITON Voice Provider: Ryan M.
- TRITON Design: tigermeat & StaticOceans
- TRITON Original Standing Art: JulieRaptor
- TRITON Recording, Development, Management, Labels & Maintenance: tigermeat
- Microphones Used: AT4040
- TRITON Logo: tigermeat
TGM databases are trained with "Millefeuille" to provide support for French.


Image of Canary

Illustration by Angela M. Chong.

Canary's Logo

Many years ago on a fateful day, Canary randomly arrived on Earth. He spends his days hanging out with robots and his boyfriend who happens to be a zombie.Canary is a singer on the DiffSinger SVS Engine. While most of his data is recorded in English, he has can sing in Japanese and Spanish as well! You can utilize Canary in the OpenUTAU engine. To learn more about using Canary, please read this page.You will find audio samples of each voice mode below.Character Information:
Age: At least 200 (appears 20) | Height: 5'8 | Weight: Varies | Nationality: Space Alien
Voice Information:
Provider: Mina Moonrise | Data: ~1.5hrs | Languages: EN, ES, JP, ZH

Canary is a member of the JAE VOCAL PROJECT, and was developed in collaboration with them and Mina Moonrise. Please click the "JAE PAGE" button above to visit their website and learn more about their projects and even more about Canary!To download Canary, click the button above that says "DOWNLOAD". If you need some pointers for installing, check out the usage guide by clicking the button above that says "USAGE GUIDE".


OpenUTAU for Lunai

I recommend using all TGM DiffSinger libraries with "OpenUTAU for Lunai", a fork of OpenUTAU made specifically for DiffSinger. You can download it at the link below! You can read more on Lunai Project's Website!

Image of TIGER's Physical Version

Voice Mode - Arc

Arc is Canary's standard vocal mode. Arc is a bubbly, warm baritone voice with a lot of versatility. You'll be surprised at how low he can go!Best Genre's: Many, including Pop, Ballads, EDM and more!Core Range: TBA | Falsetto: TBA

Voice Mode - Spark

Spark is Canary's softest vocal mode. Spark is an airy, muted baritone voice with a ton of personality.Best Genre's: Ballads, Jazz, EDMCore Range: TBA | Falsetto: TBA

Voice Mode - Voltage

Voltage is Canary's strongest vocal mode. Voltage is a bright and shocking baritone voice with focus on crisp high notes.Best Genre's: Rock, Dance, EDMCore Range: TBA | Falsetto: TBA

XLS - Cross Language Synthesis

Canary is an English voice library for DiffSinger, but supports extra phonemes for other languages! Below are some examples of Canary singing in languages other than English!To see a chart of the phonemes Canary supports, please check out this page!

Japanese | 日本語

Spanish | Español

French | Français


Canary © 2019-2024 Mina Moonrise, JAE VOCAL PROJECT
Canary DS © 2023-2024 Mina Moonrise, tigermeat
Credits:
- Canary Voice Provider: Mina Moonrise
- Canary Design: Mina Moonrise & Crusher
- Canary Key Standing Art: Angela M. Chong
- Canary Recording: Mina Moonrise
- Canary for DiffSinger Management, Development, Labels & Maintenance: tigermeat
- Microphones Used: AT2020
- Canary Logo: tigermeat
TGM databases are trained with "Millefeuille" to provide support for French.


Image of DANROU

Illustration by Guillotama.

DANROU's Logo

DANROU [d aa n r ow] is an UTAU character from 2013-2015 created by giraffeyyyy, aka tigermeat. DANROU is a young, excitable dragon learning how to navigate the world. He can be a bit reckless, but he's always has a pure heart.DANROU is now a singer on the DiffSinger SVS Engine. His voice was recorded between 2013 and 2015 for UTAU, now ported to DiffSinger. You can utilize DANROU in the OpenUTAU for Lunai engine. To learn more about using him, please read this page.You will find audio samples of each voice mode below.Character Information:
Age: 10 | Height: 4"3' | Weight: 120lbs | Nationality: Fluffy Dragon
Voice Information:
Provider: tigermeat | Data: 23 mins | Languages: JA


OpenUTAU for Lunai

I recommend using all TGM DiffSinger libraries with "OpenUTAU for Lunai", a fork of OpenUTAU made specifically for DiffSinger. You can download it at the link below! You can read more on Lunai Project's Website!

Image of TIGER's Physical Version

Voice Mode - Standard

Standard is based on DANROU's original VCV UTAU voicebank. It has a punchy tone with a cute twang.Best Genre's: Many, including Pop, Electronic, VOCALOID & more!

Voice Mode - Soft

Soft is based on DANROU's original Soft VCV UTAU voicebank. It has a airy, sweet tone and pairs well with Standard.Best Genre's: Many, including Pop, Ballads, VOCALOID & more!

Voice Mode - Light

Light is based on an unreleased 2-pitch CV UTAU voicebank for DANROU. It has a young and bright tone.Best Genre's: Many, including Pico-Pop, Electronic, VOCALOID & more!

XLS - Cross Language Synthesis

DANROU is a Japanese voice library for DiffSinger, but supports extra phonemes for other languages! Samples are coming soon.


DANROU © 2013-2024 tigermeat
DANROU DS © 2024 tigermeat
Credits:
- DANROU Voice Provider: tigermeat
- DANROU Design: tigermeat & Guillotama
- DANROU Recording: tigermeat
- DANROU for DiffSinger Management, Development, Labels & Maintenance: tigermeat
- Microphones Used: Blue Yeti
- DANROU Logo: tigermeat
TGM databases are trained with "Millefeuille" to provide support for French.


Image of Canary

Illustration by ひやま

Canary's Logo

Miyo is a nonbinary rabbit yao who strives to be the best helper in the world! They are extremely benevolent and selfless to the point of hardly ever taking time for themselves, even if risking burnout.Miyo is a singer on the DiffSinger SVS Engine. Miyo was recorded to sing in English, Japanese and Chinese, but can sing in many languages with XLS! You can utilize Miyo in the OpenUTAU for Lunai Editor. To learn more about using Miyo, please read this page.You will find audio samples of their voice below.Character Information:
Age: Unknown (Appears 24) | Height: 5'5" | Weight: Secret | Species: Rabbit Yao
Voice Information:
Provider: 実偽Migi | Data: ~90min | Languages: EN, JA, ZH

To download Miyo, click the button above that says "DOWNLOAD". If you need some pointers for installing, check out the usage guide by clicking the button above that says "USAGE GUIDE".


OpenUTAU for Lunai

I recommend using all TGM DiffSinger libraries with "OpenUTAU for Lunai", a fork of OpenUTAU made specifically for DiffSinger. You can download it at the link below! You can read more on Lunai Project's Website!

Image of TIGER's Physical Version

Here's a sample of Miyo's Alpha voicebank singing "King of Anything" by Sarah Bareilles.Cover: tigermeat & Mina Moonrise
Video: Purpled
Artwork: Mame
Note: This sample uses outdated recordings. The final version of Miyo's voice has been rerecorded.


Miyo © 2023-2024 実偽Migi
Miyo for DiffSinger © 2024 tigermeat, 実偽Migi
Credits:
- Miyo Voice Provider: 実偽Migi
- Miyo Design: TSaianda (maid, butler), Fiorrie (traditional)
- Miyo Key Standing Art: ひやま
- Miyo Recording: 実偽Migi
- Miyo for DiffSinger Management, Development & Maintenance: tigermeat
- Miyo for DiffSinger Labels: tigermeat & FerretFather
- Microphones Used: AT2020
- Miyo Logo: リノ
TGM databases are trained with "Millefeuille" to provide support for French.

Image of Leif

Illustration by FerretFather

Leif's Logo

Leif was created in collaboration with FerretFather and tigermeat. He is recorded in English and Japanese, and comes with 4 voice modes: Blossom, Petal, Uprooted and Lush. They are split into 8 different modes however, each voice mode has an English and Japanese version.Description from FerretFather: "Being quite literally nothing but a mass of plants, be can seem quite aloof or unfocused. His behavior, while vaguely human, can seem awkward and unusual. He does go out of his way however to display acts of kindness and cares deeply for those in close proximity to him. Depending on his perceived mood, he can sprout different colors of flowers in his beard. Being comprised of nothing but plants, Leif technically does not have a gender, but uses he/they pronouns and seems to be attracted to men."

Please take a listen to this demonstration of Leif's Voice!


OpenUTAU for Lunai

I recommend using all TGM DiffSinger libraries with "OpenUTAU for Lunai", a fork of OpenUTAU made specifically for DiffSinger. You can download it at the link below! You can read more on Lunai Project's Website!

Image of TIGER's Physical Version

Leif © FerretFatherLeif's voice and image is intellectual property of FerretFather.

Banner for LabelMakr

LabelMakr is a GUI tool to assist you in creating phoneme-level transcriptions for sung audio data. LabelMakr utilizes Whisper by OpenAI and SOFA by qiuqiao to automatically transcribe and force-align at a word and phoneme level! LabelMakr has a built in transcription editor so you can make sure the generated transcriptions are accurate!To use LabelMakr, click "download" above to download the portable Windows version, or download the source code from the "Github" link above!

A screenshot of LabelMakr.

A screenshot of LabelMakr and the built in transcription editor.

Sequence File Repository

On this page, you'll find a repository of all of my released sequence files. There's all sorts, so take a look! Each is notated with which types it includes, but if you need to do any conversion please check out Utaformatix! It's a very useful tool for converting voice synth sequences.All downloads are via Google Drive below.By using any sequence files, you agree to the following terms and conditions:

  • You will provide credit to me (tigermeat) and any other parties listed in the "readme" file of the specific file. Likewise, you will not claim that you created them.

  • You will NOT redistribute edits/other formats of my files. No exceptions.

  • You MAY tune, edit, or change any part of the files, so long as it can not be construed as offensive or targeted (ex: Offensive parody version of a song, etc.)


'tigermeat character' Terms of Use

Usage of all TGM Characters image, name and voice models of any kind fall under a CC BY-NC-ND 4.0 & Commons Clause License. This page will further explain the license, as well as introduce terms and conditions for usage of a tigermeat character's name/image. Any questions or concerns can be brought to me directly by emailing me at [email protected].

Section 1: Standard Usage

  • Upon downloading any image or voice model of a TGM Character, you agree to all terms written on this page, as well as outlined in the CC BY-NC-ND 4.0 & Commons Clause License.

  • Any use of a TGM character cannot under ANY circumstance promote hate, racism, homophobia, sexism, transphobia, misogyny, or any type of negativity towards a group of people with intentions to harm.

  • TGM Characters may not be used to portray any illegal activity under United States law, and the law of the country the TGM Character is being used in.

  • TGM Characters shall not be portrayed as any race other than the race they appear to be in official illustrations & especially character reference sheets. Please do not change the complexion of the character drastically. Lighting changes are fine, but please be respectful of the characters race.

  • R18, NSFW or Explicit work is acceptable for the following TGM Characters: TIGER, TRITON. You may use these characters names, images or voice models in NSFW works so long as the actions portrayed in said content is legal under the laws of the United States of America AND the home country of the user, if they are not residing in the United States of America. If you are unsure if your work is allowed because of NSFW content, please contact tigermeat.

  • Please do not use TGM Characters to spread harmful misinformation, such as uninformed or bias news and science.

  • You ABSOLUTELY MAY use TGM Characters to do any of the following: Spread love, speak on social issues so long as they do not spread or condone hate to any group, inform others on events, and much more!

Section 2: Code Usage

  • As is the nature of user-focused voice AI, there is some code involved. Please follow all rules outlined in the CC BY-NC-ND 4.0 & Commons Clause License when utilizing TGM Character voice models for any purpose.

  • TGM Character voice models can be used as a warm start for training other models, if applicable to the type of voice model (Not available for DiffSinger due to models being exported to ONNX for use in OpenUTAU) so long as proper attribution under CC BY-NC-ND 4.0 is provided.

Section 3: Commercial Use

  • Commercial usage of TGM Characters is acceptable so long as a license is purchased from tigermeat. To acquire the license, click the "Commercial License" button on each respective character's page. You can purchase the license as a digital-only delivery item, or purchase the physical version of a voice if it exists.

  • The usage of all TGM Characters names, images, and voice models in any type of Crypto Currency or NFT work is STRICTLY PROHIBITED and will not be tolerated under ANY circumstances.

  • TGM Character license fees for commercial use falls under any usage that would generate revenue for or by any means. This includes charity, however TGM Character licensing fees may be discounted/waved for charities or carity events. Please email tigermeat to find out more if you're interested!

Section 4: Visual Usage

  • In using TGM Character images, you agree to publicly state the name of the illustrator of said image. Any official image of TGM Characters will clearly state the illustrator either in supporting documents or the file name.

  • In using TIGER's official illustrations by Aquiboni, you must state they were illustrated by Aquiboni in the published work with no exceptions. The illustrations cannot be edited or changed significantly or stated/implied to be a separate character. (EX: fan-made derivative)

  • The usage of any of TGM Character's visuals for use in generative image AI model training is STRICTLY PROHIBITED and will not be tolerated under ANY circumstances.

'Canary' Terms of Use

This section is pulled directly from Canary's page on the JAE website. For the most current information, please check out this page.

Section 1: General Terms

  • Do not claim Canary or official artwork as your own.

  • NSFW content is allowed. This does not apply to MMD models and other works from people who are not from the JAE VOCAL PROJECT group. Please follow the rules of the creators! If their works are not allowed to be used for NSFW content, then do not use them for NSFW content!

  • For commercial use of his voicebanks and character, permission is required.

  • Derivatives must be made with permission.

  • Oto.ini editing is allowed.

  • Do not edit Canary's samples.

  • Shippings and pairings are allowed, but do not force as canon. Do not ship Canary with characters under the age of 18.

  • Do not use Canary's character and voicebanks for hate speech.

  • Political use is not allowed.

  • Voicebank redistribution is forbidden.

  • Unofficial roleplay, accounts, and ask blogs are allowed if permission from Mina Moonrise is given. Must credit JAE VOCAL PROJECT and Mina Moonrise for character rights. Must credit the appropriate artists if using official. Do not use art from non-members without their permission. Cannot violate existing ToS (no shipping with minors, hate speech, etc). For any questions, please contact JAE VOCAL PROJECT or Mina Moonrise.

  • Regarding AI creations: Creating the character of Canary through AI artwork generation applications and software is prohibited. Using artwork of Canary to train AI artwork generation applications and software is prohibited, especially without the explicit consent of both JAE VOCAL PROJECT and the original artist. Using Canary's UTAU and DeepVocal voicebanks to train and create, use, and distribute AI voices and voicechangers is prohibited without explicit consent of both JAE VOCAL PROJECT and Mina Moonrise.

Section 2: Code Usage (Specific to Canary DS Only)

  • As is the nature of user-focused voice AI, there is some code involved. Please follow all rules outlined in the CC BY-NC-ND 4.0 & Commons Clause License when utilizing TGM Character voice models for any purpose.

  • TGM Character voice models can be used as a warm start for training other models, if applicable to the type of voice model (Not available for DiffSinger due to models being exported to ONNX for use in OpenUTAU) so long as proper attribution under CC BY-NC-ND 4.0 is provided.

TGM Phoneme Guide

On this page, you'll find a guide for using the "TGM" Phonetic system. English phonemes are standard arpabet, but to provide support for other languages, there are some extra phonemes. I'll be providing equivalents to other DiffSinger standard systems, as well as their X-Sampa or V-Sampa equivalent. NOTE: V-Sampa is my shorthand for Vocaloid X-Sampa, since it differs from standard X-Sampa in some cases. The English X-Sampa examples include Standard X-Sampa, and V-Sampa where it differs from standard X-Sampa.NOTE: All fully supported languages, which are listed here, can be used without the need for manual phonetic editing by using the respective Phonemizer in OpenUTAU. This chart is purely for those interested in manually editing phonetics while using TGM voices.To learn more about X-Sampa, please check out the Wikipedia page here!

Base English Phonemes

TGMArpabetX-SampaTGMArpabetX-SampaTGMArpabetX-Sampa
aaaaa, Qbbb, bhsss
aeae{chchtSshshS
ahahVddd, dhttt, th
aoaoOdrd rdr\, dZr\ththT
awawaU, {UdhdhDtrt rtr\, tSr\
axax@fffvvv
ayayaI, AIhhhhh, h\www
ehehEggghyyj
erer@`, @rjhjhdZzzz
eyeyeIkkk, khzhzhZ
ihihIlll, 5qq, cl?
iyiyi, i:mmmdxdx4
owow@U, oUnnn   
oyoyOI, oIngngN   
uhuhUppp, ph   
uwuwu, u:rrr\   

General "Special" Phonemes

TGMX-SampaNotes
SPN/ASilence
APN/ABreath/Inhale
cl_}Diacritic for removing the "pop" from an ending plosive consonant.
vfN/AVocal Fry
bkN/AVoice Break

French Phonemes

The French system used in TGM is based on "Millefeuille." Read more about it on the Github repo here!

TGMMILX-SampaTGMMILX-SampaTGMMILX-Sampa
aahabbbzzz
eeehedddzhjZ
ehaeEfffyyj
axee@gggwww
oeoe2kkkuyuyH
iyihilgll   
ooohommm   
ohooOnnn   
uuouuppp   
yuuhyrhrR   
aanenA~sss   
ehninE~shshS   
ynun2~ttt   
onono~vvv   

In TGM, the word "bonjour" would be b on zh uu rh, but in Millefeuille it would be b on j ou r.

Spanish Phonemes

The Spanish system in TGM is based off of the way Gianloop's TISD is labelled.

ESTGMV-SampaESTGMV-SampaESTGMV-Sampa
padreaaperropxptuyotxt
eneroeeequisekxkbestiabb
míoiyiobtusobvBdedodd
focoooodedodhDgatogg
musauuutrigogvGconchachtS
ciudadyjfaseffcerro*thT
huevowwcasassxicohxx
muyyIsinnnañony/n yJ
neutrowUlanalglcarorsr
polloyL/j\carrorrrr   

*: European Spanish

In TGM, the word "Perro Salchicha" would be [px ee rr oo] [s a lg ch iy ch a], but in V-Sampa it would be [p e rr o] [s a l tS i tS a].

Japanese Phonemes

The Japanese system in TGM is a custom triphonic system.

TGMV-SampaTGMV-Sampa
aann
iyin yJ
uxMhhh / h\
eeehh yC
ooofpp\
nnN/N/N`fp yp'
kxkbb
k yk'b yb'
ggpxp
g yg'px yp'
ssmm
shSmym'
dzzyj
txtrj4
tx yt'rj y4'
tstsww
chtSdd
dzdzd yd'
q?  

In TGM, the word "熱帯魚" would be n ee q tx a iy g y oo, but in V-Sampa it would be n e ? t a i g' o.

Mandarin Chinese Phonemes

The Mandarin Chinese system used in TGM is a custom triphonic System. This is different from the standard DiffSinger Chinese system, as sounds are broken into two OR three phonemes, instaed of just two.This system is imperfect, and is only meant to allow non-Mandarin speaking singers to sing in Mandarin.

PinyinTGMV-SampaPinyinTGMV-SampaPinyinTGMV-Sampa
aaaiaoia wiAUb-bp
ooooiuio wi@Up-pp_h
eex7ana xna_nm-mm
iiuienex xn@_nf-ff
uuuuiniy xni_nd-dt
v/üyuyianie xniE_nt-tt_h
erer@`uanua xnua_nn-nn
ci/ziizi\unuex xnu@_nl-ll
shiiri`vn/ünyu xny_ng-gk
aiayaIvan/üanye xny{_nk-kk_h
eieyeiangaa ngANh-hxx
aoauAUengex ng@Nj-jyts\
ouow@Uingiy ngiNq-qyts_h
iaiaiaiangia ngiANx-xys\
ia(n)ieiE_ruangua nguANzh-jhts`
uauauauenguex ngu@Nch-chts`_h
uououoongoo ngUNsh-shs`
ve/üeyeyE_riongio ngiUNr-rzz`
uaiw ayuaIz-dztss-ss
ueiw eyueic-tsts_h-nxn_n

In TGM, the sentence " 她在吃水果 (tā zài chī shuĭguŏ)" would be [t a] [dz ay] [ch ir] [sh w ey] [g uo].

Korean Phonemes

The Korean system used in TGM is based on the Korean phonetic system by "CODA-SVS" and the Korean support in the VB's mainly come from Kitane Sno's Korean DiffSinger DB. For more information, please check out this document!

한굴TGMV-Sampa한굴TGMV-Sampa한굴TGMV-Sampa
aaㄱ-ggㅊ-chch
애/에eeeㄲ-kxg'ㅋ-kk
iyiㄴ-nnㅌ-tt
oooㄷ-ddㅍ-pp
uuuㄸ-txd'ㅎ-hhh
eo7ㄹ-rjr-ㄱkqgp
uxM-ㄹㄹ-lxl-ㄴnnp
y ajaㅁ-mm-ㅅtqdp
얘/예y eejeㅂ-bb-ㄹrlrp
y oojoㅃ-pxb'-ㅁmmp
y uujuㅅ-sxs-ㅂpqbp
y eoj7shxsh-ㅇngNp
w aoaㅆ-sss   
w eeueshsh'   
w iyuiㅈ-cc   
w eou7ㅉ-jhc'   

In TGM, the word "한굴" would be hh a n g uu rl, but in V-Sampa it would be h a np g u rp.

Image of TIGER

icon by @bayboyzone

Well Hello There!

Howdy! I'm Tyler. I'm 25, from Chicago and do lots of internet things. I currently am focused hard on developing voices for DiffSinger, but I also code apps and things in Python! If you have a question, or just wanna say hi, feel free to hit me up at any of the places below! You'll see me the most active on twitter, however!

Contact me

DiffSinger Install/Usage Guide

In this guide, you'll find some information on how to install and utilize DiffSinger. The guides to install are specific to TGM Characters, but usage is pretty universal!All guides on this page will be using Windows 11, but the same steps should apply for your personal operating system. As always, if you have any questions, feel free to reach out to me directly!

Step 1: Download Voice + Install

First, you'll need to install OpenUTAU. TGM DiffSingers will work on any fork of OpenUTAU that supports DiffSinger, however I recommend using the main, official version of OpenUTAU. Please follow this link to download OpenUTAU from Github. The Github page has a few guides to ensure you're able to install properly!Next, you'll want to download the voice library. On each characters respective page, you'll want to click "Download", generally the first button at the top of their page. You'll download a file called something like "TIGER_DS_(version#)_PACK.zip". This will include all of the necessary things you need to use the voice.Upon unzipping the "pack" file, you'll see something similar to this screenshot below.

"Voice Library" contains the voice library. Drag the ".zip" file in that folder into OpenUTAU, hit "apply" on the prompt that pops up and once all of the items are unloaded, you'll be able to start using the voice!"OpenUTAU Plugins" contains any custom plugins the singer can use, specifically the TGM English Phonemizer. Drag the ".dll" files into OpenUTAU and click "okay" on the prompt for each. You'll need to restart OpenUTAU to use these phonemizers.

Image of TIGER

After completing the steps above, you're ready to start using DiffSinger in OpenUTAU!

Step 2: Adjust Settings in Preferences

By default, OpenUTAU will try to render DiffSinger with the default settings, which I highly recommend changing. To adjust, in the main window click "Tools > Preferences..."

Scroll down to "Rendering".If you do not have a GPU in your system, or are not using a Windows version of OpenUTAU, you'll want to leave "Machine Learning Runner" as the default. If you have a GPU on Windows, change it to "directml". If you are using a GPU, ensure the proper one is selected in the "GPU" drop-down box.Ensure "DiffSinger Render Depth" is set to 400, and you can change "DiffSinger Render Speedup" depending on your preference.For DDSP Models:
Higher Quality = Lower Numbers
Lower Quality = Higher Numbers
"Prefer Quality" = 1
"Prefer Speed" = 100
For Reflow Models (TGM v106 and up):
Higher Quality = Higher Numbers
Lower Quality = Lower Numbers
"Prefer Quality" = 100
"Prefer Speed" = 1

Image of TIGER

Step 3: Basic Usage

For using a DiffSinger voice for the first time, you'll want to select the voice from the track and ensure you're using the proper phonemizer!

Image of TIGER

To select the singer, click on "Select Singer" from the track you're using. If you do not see a newly installed singer there, click on "DiffSinger..." at the bottom, and an expanded list of installed singers will appear for you to choose from.

To select the phonemizer, click the line below "Select Singer" after selecting a singer, and choose from a list of Phonemizers. All DiffSinger phonemizers have the prefix "DIFFS". DiffSinger will only work with the special DiffSinger phonemizers. TGM Voices work best with "DIFFS EN" or "DIFFS EN TGM", but all voices will support "DIFFS", "DIFFS ES", and the yet-to-be-released "DIFFS JA". To use voices in Japanese, currently support is available via "DIFFS" by using Hiragana lyrics.

Image of TIGER

Step 4: Editing the Voice

This section will cover utilizing DiffSinger in the most efficient way possible, like directly editing phonemes, using the built in pitch model, and putting the voice modes to work!

Image of TIGER

Above is a screenshot of the OpenUTAU musical part editor. This is where the magic happens! You can see a few different things; the control bar at the top, the notes, the phoneme timings and the parameter bar at the bottom. These are the big 4 things you'll be looking at while using DiffSinger.You'll type your lyrics in the notes you see or enter in the part, but what if one part isn't said the way you'd like it to be? You may notice in the screenshot above that some of the lyrics are inside of square brackets "[ ]". Any notes with those brackets will be read as phonemes as opposed to lyrics. You can see how the note with "[f aa]" has the phonemes "f" and "aa" in the phoneme timing section.

Image of TIGER

If you'd like to load pitch from the built in pitch-tuning model, select the line you'd like to render pitch on and click "Batch Edit > Notes > Load Rendered Pitch

After clicking that, you'll notice that there is a pitch curve drawn on top of the notes you selected!

Image of TIGER

If you want to make any changes to the pitch, you'll click the "Draw Pitch Tool" at the top (pencil with the line, right of the eraser) or press "4" on your keyboard.

Image of TIGER

Next, let's play around with the voice modes! To active them in the parameter editor, you'll click the gear icon at the bottom left.

Image of TIGER

After clicking the gear, select "Add all expressions suggested by renderers" and click "Apply". This will allow you to use all voice modes as curves, but also things like the proper GEN curve, proper VELC curve and pitch expressiveness.

After doing so, you'll be able to select any parameter from the drop down and use it as a curve! You can get some very expressive singing by mixing parameters and voice modes together this way!

Image of TIGER

Step 5: Conclusion

To conclude, there's a ton of different ways to use DiffSinger, so play around and find out what works best for you! If you have any questions, or run into any issues, please do not hesitate to reach out for me. My contact information can be found below!