Shoonya FAQ - AI4Bharat/Shoonya GitHub Wiki

1. What is Shoonya? Shoonya is an open source platform to annotate and label data at scale, built with a vision to enhance digital presence of under-represented languages in India.


2. What is the purpose for Shoonya? Data collection/curation is usually the core of building state-of-the-art NLP ML models. This is where Shoonya comes into picture. Shoonya provides the platform for the Annotators/Translators to create such large datasets with highest quality.


3. What is NLTM? The National Language Translation Mission (NLTM) was launched to enable online services in local Indian languages. This will enable the wealth of governance-and-policy related knowledge on the Internet being made available in major Indian languages. The Ministry of Electronics and Information Technology (MeitY) has launched 'Bhashini' to help ensure that digital content is readily available to all citizens, in their preferred languages.


4. Where is Shoonya fitting in the overall picture?


5. What are the current supported project types in Shoonya? Currently the following project types are supported:

  • Sentence Verification
  • Sentence Translation
  • Semantic Translation Rating
  • Conversation Translation
  • Speech Transcription