Guidelines: Transcription, Translation & Voice Over - AI4Bharat/Chitralekha GitHub Wiki

Transcription

Duration

• Subtitles should appear and disappear exactly when the words are spoken.

Length

• It is recommended to maintain a word count range of 15 to 20 words per card in case of lectures. And for entertainment content, the recommendation is to maintain an approximate of 14 words per card for easy readability.

Structure

• Do not merge the end and beginning of two sentences. • Ensure sentences have proper punctuation.

Line Treatment

• Text should usually be kept to one line • Should not exceed 2 lines

Complete Sentence

• Recommended to complete a sentence in one text-tab/card. Do not split a sentence if not necessary for length/speed.

• If the complete sentence exceeds 2 lines, split the sentence into more text-tabs/cards.

• In cases where a single sentence is to be split into two tabs/cards or two lines, follow these basic principles:

  1. The line should be broken
  • After punctuation marks
  • Before conjunctions
  • Before prepositions
  1. The line break should not separate linguistic units such as
  • A noun from an article
  • A noun from an adjective
  • A first name from a last name
  • A verb from a subject pronoun
  • A prepositional verb from its preposition
  • A verb from an auxiliary, reflexive pronoun or negation.
  • Do not break a person’s name or title into separate lines or tabs/cards.

Translation

• Edit translations to be as short as possible without changing its meaning.

• Ensure the number of words in the target language does not exceed the number of words in the source language by 3 words. Check the colour of the number given under each text-tab/card which indicates the above. The number denotes the count for words in the tab.

  • If the number is green in colour, it means the number of words in the target language is same as or close enough to the number of words in the source language. This ensures sufficient time for the text in the tab/card to be voiced during the voice over task.
  • If the number is red in colour, it means the number of words in the target language is significantly more than the number of words in the source language. In such a scenario, during voice over of the particular tab/card, there will be lack of clarity in speech due to insufficient time to voice the text. Therefore, when the colour is red, try to reduce the word count of the target language to match that of the source language. Less is fine.

Voice Over

Machine Generated Voice Over

• Ensure that the voice for each card is listened to before completing the task to ensure they are all accurate and of high quality.

• Should a card have lack of clarity when played, try rewording the sentence in the card to ensure the words can be spoken within the given duration. In most cases, the cause for lack of speech clarity is too many words in the card than can be voiced in the set duration.

• Note that the duration of each card cannot be edited in Voice Over. This has been set to match the overall duration of the video.

• In cases where there are numbers or decimals in the card, there is a possibility that voice is not generated. Rewrite the number/decimal in words and regenerate voice in such situations.

Manual Voice Over

• Ensure that recordings are done in a quiet room with no background noise.

• Ensure the mic used for recording the voice is of good quality.

• Space out the voice recording to match the duration set for the card to ensure there are no silences or lack of clarity.

REFERENCES

  1. https://partnerhelp.netflixstudios.com/hc/en-us/articles/215758617-Timed-Text-Style-Guide-General-Requirements
  2. https://www.ted.com/participate/translate/subtitling-tips
  3. https://engagemedia.org/help/best-practices-for-online-subtitling/