Home - LuciozUndiezz/VocaloidDBTool-SAK GitHub Wiki

Welcome to the Canned-Bread’s VOCALOIDDBTOOL Swiss Army Knife wiki!

Here you’ll see how to use this program and how to make a HOMEBREW Vocaloid DB!

This will be separated into different pages! Please beware of updates to this wiki page!

SEE FAQ BRFORE YOU ASK ME ANYTHING (not made yet). If you have any further questions you can DM me on Reddit (u/jamburgersandwitch) or on Discord (bread_in_a_can)

Step one, make an utau voicebank!

You can probably find a good place googling it, but if you just wanna make a port you can skip this since you already have one

Step two, .trans generation!

See here for more details

Step three, .seg generation!

See here for more details

Step four, making the voicebank folder!

Open the devkit (you'll have to find it yourself or have someone give it to you, sorry)

Click on Screenshot 2023-10-09 131942

Click Screenshot 2023-10-09 132019

Click on a dictionary file like this one

image

Click on Screenshot 2023-10-09 132408

Make a new folder and name it your singer name to keep things organized, open that folder and put the singer name in the bottom bar like this

Screenshot 2023-10-09 132633

When this window pops up, click on what language you want vocaloid to recognize your voicebank as.

Screenshot 2023-10-09 132658

There’s only these three, but if you are creative enough, you can make any language you’d like, like Russian or Polish. You just need to edit the dictionary file you load in and its phonemes.

Create an articulations folder and a stationary folder (you'll need these because you have two different kinds of files)

image

The articulations folder should have this:

.trans (the ones you made with Auto-trans)

.seg (the ones you get from lab2seg)

.wav (your wav files you get from the genon2db work fine)

it should look like this:

image

The stationary folder should have this:

.trans (the ones you get from converting with the trans-convert)

I would say .seg you get from lab2seg but for stationary you're gonna have to segment those yourself sorry

.wav (your wav files you get from the genon2db work fine)

it should look like this (the same image from articulation folder just ignore the .seg files lol):

image

Step five, configuring!

time to rewrite this because I just deleted all of my progress lmfao

From here on out there will be no images because I'm finishing this at work

Open the devkit and hit file

click on Automatic segmentation

In the browser window, find the articulations folder you made and click OK

The next window that pops up, find the devkit folder and click OK

Now you will see the automatic segmenting window

You might need to edit the segments, so click on a .trans file listed in the automatic segmenting window and click View/edit segmentation

To properly segment, click the top GREY bar and drag the RED lines needed.

Lets take an example file (a_i.seg) for example

You need to move the first line (the one with [a] next to it) and move that to the beginning of the a phoneme.

Then, you’ll have to move the line with [i] next to it to the beginning of the i phoneme.

Think of this like highlighting the phoneme from the beginning. You can click the top GREY bar to highlight the space and timing each phoneme.

To save, just close that window. To apply the changes, click on segment articulations.

Most of the time the devkit segments these well, but when it doesn’t, you’ll have to press view articulation segmentation and do the same process you did for the .she editing. This looks different, but it works the same.

To save, close the editor window. To apply, press add articulations to DB.

Once you’re done with all of those, you can edit the stationary.

To do so, go back to file and automatic segmentation whatever its called

Instead of finding your articulations folder, find your stationary folder. Press OK

Find the dev kit folder, press OK

now you see the automatic segmentation window again!!

Now you’ll have to make the .seg files manually.

To do so, click on segment phonemes

Now you can open the waveform editor again and edit the timing. This works a little differently.

It works similarly, but there’s an empty line after each line with a phoneme. This is the end of the phoneme. Move the line so the empty line is at the END of the phoneme. You can have the next phoneme line after this next to this one.