-stellar evolution- (DiffSinger)

Release Date
Recording Languages	English & Japanese
Additional Languages	Spanish + Italian Trained with Gianloop’s dataset + Criss Namine’s Dataset French Trained with UTAU French Resources’ Petite Millefeuille Korean Trained with CSD (labels by heta-tan) Mandarin Chinese Trained with OpenCPop (labels by heta-tan)
Recording Range	D3 – C6
Voice Colors	Main Sequence, White Dwarf, Nebula, Red Giant, Neutron, Protostar, Supernova, Black Hole

Last updated: July 7th, 2025

Current voicebank version: V7

Page and voicebank are still WIP!

Thank you for your patience!

Stellar Evolution is the process in which a star changes over time. Starting from stellar nurseries such as Nebulae or molecular clouds, matter condenses to form a Protostar. Once the star starts to stably burn hydrogen into helium, it’s considered to be a Main Sequence star, where it will spend about 90% of its life cycle. As stars run out of hydrogen to convert, the core will begin to collapse, becoming a Red Giant. Depending on the star mass, it can take one of two routes: low mass stars will see their outer layers blow away, creating a planetary Nebula and the remain core becoming a dense White Dwarf. Larger mass stars can explode into a Supernova, collapsing into either an extremely dense Neutron star or a Black Hole.

Aphelia -stellar evolution- is an AI singing voice bank made using DiffSinger and singing data from their voice provider, heta-tan. -stellar evolution- offers the most realistic and versatile version of Aphelia. -stellar evolution- is officially able to sing in several languages including: English, Japanese, French, Italian, Spanish, Korean, and Mandarin. As well, they include 7 different voice colors to help them sing a wide range of genres and singing styles.

Change Log (Click to show)

Re-recorded large sections of voicebank (Mainly Main Sequence + Red Giant + Supernova) to reduce reverb
Re-added Black Hole (whisper) voice color
Added singing Japanese recordings to all voice color
Added more growl recordings
Added more data for range
Trained using multi-dict configuration
Added PJS Japanese database
Added Criss Namine Spanish database
Trained with Muon Lynxnet2
Trained Aphelia_PC_hifigan to have in-house tone shift support

Trained with lynxnet +melody encoder
Re-recorded large sections of Main Seqeuence and Red Giant to reduce reverb
Re-recorded and added large sections of Supernova to increase quality
Added new phoneme: gr (growl VERY EXPERIMENTAL!)
Added more data for range
Added Korean support using CSD
Added Mandarin support using OpenCPop
Edits to phoneme system to make it more intuitive

Full Dataset!
Added new phonemes: cr (voice creak), MM (closed mouth humming), NN (open mouth humming) and LL (long L humming?)
Began working on Portuguese replacement dictionary (HEAVY WIP!)
Added DIFF JA support
Updated vocoder (aphelia_nsfgan) to include full dataset
Text tutorial is mostly drafted

Mostly all new data
Added Spanish/Italian support thanks to Gianloop
Added French support thanks to UTAU French Resources
dsdict-en.yaml has been added to improve pronunciation and allow for easier use of additional phonemes
Phonemes have been changed
- rr/r (JPN) have been merged and changed to [rx]
- h (JPN) has been changed to [hh] to merge with English data
- jh (JPN) has been changed to [jh] to merge with English data
Custom vocoder (aphelia_nsfgan) has been included
Vocal mode -moon- (Japanese) has been merged with -main sequence-
Vocal mode -supernova- (Rock) has been added

Main Sequence is the default voice color for this voicebank and is the most similar to Aphelia’s VCCV English and their -capricornius- /-ophiuchus- banks. Main Sequence tends to have a dark, mature and masculine lower range that transitions to a youthful, sweet feminine higher range.

Sample of Recorded Genres	Jazz, Pop-Rock, Soul, City Pop, Pop
Recording Range	D3 ~ A5
Recording Amount	1:53:09

ART + DESIGN NOTE DOWNLOADS

White Dwarf is the soft voice color. This voice color is most similar to their -pisces- or -corona- ‘s whisper expression. It’s a very soft and airy voice with delicate pronunciation while retaining qualities of Main Sequence. This voice works well in tandem with Nebula.

Sample of Recorded Genres	Acoustic, Bossa Nova, R&B, Easy Listening
Recording Range	E3 ~ E5
Recording Amount	0:52:57

V.5 Demonstration

ART + DESIGN NOTE DOWNLOADS

Nebula is a mature, deep, soft voice that can take on an almost operatic approach in its higher register. This voice color is most similar to their -cygnus- or -corona-‘s dark expression. This voice is full of emotion and can sometimes take on a sultry quality. You can use this voice color together with Neutron as a soft counterpart.

Designed by Gardanana

Sample of Recorded Genres	Ballads, Folk, Atmospheric, Jazz, Soul, Acoustic, Opera
Recording Range	F3 ~ F5
Recording Amount	0:51:09

V.5 Demonstration

Art and Design notes download

Red Giant is considered the “default power” voice of this voicebank. This voice color is most similar to their -solar flare- base and light expressions. It is a confident and bold voice with clear pronunciation while still retaining qualities of Main Sequence. The higher range for this voice color is very bright and strong with a very cheerful tone. This voice is a more powerful version of Main Sequence and works well blended with Neutron.

Sample of Recorded Genres	Disco, Pop, Funk
Recording Range	G3 ~ B5
Recording Amount	0:49:08

Art and Design Notes Download

Neutron is a powerful compliment to Nebula. This voice color is most similar to -stratosphere- chest expression. It is a more mature, chest based vocal compared to Red Giant that can have looser and more exaggerated pronunciation. This vocal is more of an operatic type of vocal than it is a rock/typical power vocal.

Designed by liv/ricecrispty

Sample of Recorded Genres	Symphonic, Musical Theater, Alternative/Indie, Folk-Rock, Opera
Recording Range	E3 ~ D5
Recording Amount	0:52:03

V4 Demonstration

ART and Design Notes Download

Protostar is voice acted, childish and nasally voice color compared to the rest of the voicebank. This voice color is most similar to -corona- ‘s sweet expression or to Ame Wakana, another UTAU voiced by heta-tan. This expression is mainly for fun so it may not produce the best vocals for serious projects, but they might be useful in XSY as a pseudo-gender flag or for more cartoony vocals.

Designed by ztarplatinum

Sample of Recorded Genres	Children’s Songs, Bubblegum pop
Recording Range	G3~F5
Recording Amount	0:29:02

V4 Demonstration

Art and Design Notes Download

Supernova is a powerful and playful voice. This voice mode is most similar to the -solar flare- yell expression. Unlike Neutron, this voice color is less chest based, having a much better higher range for belting vocals. Unlike Red Giant, there’s more exaggerated and looser in its singing style and pronunciation. Sometimes, this vocal can have a gritty quality to them, especially in the higher range.

Sample of Recorded Genres	Classic Rock, Vkei, Pop Punk, Hard Rock
Recording Range	G#3~C6
Recording Amount	0:38:58

Art and Design Note Download

Black hole is a whispering vocal. This is an experiment vocal to see if DiffSinger can handle very airy, nearly whispering vocals with the help of “trash” phonemes. With negative tension values, you can obtain 100% whispering vocals. This voice color is well suited for harmonies, ad-libs and “effect” vocals instead of primary vocals.

Sample of Recorded Genres	Children’s Songs
Recording Range	E3~D5
Recording Amount	0:09:31

Art and Design Notes DOwnload

DOWNLOADS

Voicebank _{(INCOMPLETE ART FOLDER)}

ALT DOWNLOAD LINK (Google Drive)

If you are having issues with the mediafire download, I’ve also uploaded the voicebank to Google Drive as well!

Bonus Goodies

Exvoice (Google Drive)

Full-Res Art (Google Drive)

Text Tutorial (Google Doc)

What’s Included

Aside from a DiffSinger voicebank that can sing in English, Japanese, French, Spanish, Italian, Korean and Mandarin, Aphelia’s -stellar evolution- package will include the following:

0_Extras
- exVoice Link
  - Additional recordings (EN/JA): laughter, counting, adlibs, etc.
  - ✓ Allowed: covers, creative projects
  - X Prohibited: training AI models (RVC, soVITS, Diff-SVC, etc.)
- Resources
  - Tutorial PDFs
  - Phoneme charts
  - Base dictionary files
Vocal Color Archives
- Each voice includes:
  - Full body art (HD + OpenUTAU-optimized)
  - Icons
  - Logos (DiffSinger / voice color / Stellar Evolution)
  - Additional artwork
  - Reference sheets + design notes

Requests and feedback for new features can be sent through this Google Form

Bank Type	Upload Date	Recording Range	Vocal Modes
Multilingual DiffSinger	July 7th, 2025	D3-A5	Main Sequence White Dwarf Nebula Red Giant Neutron Protostar Supernova Black Hole

For a multilingual AI voice changer, please refer to their soVITS model!

SoVITS