Literature Review - CankayaUniversity/ceng-407-408-2024-2025-EmoTagger-A-Community-driven-Emotion-Annotation-Tool GitHub Wiki
Literature Review
Abstract
Music, serving as a universal means of communication, has significantly impacted human history by profoundly connecting with people's emotional experiences. As technology progresses, digital music platforms have changed how individuals engage with music, providing a chance to discover the emotional bonds they form with tracks. Currently, methods for examining music's effects on people's emotions have become significant, particularly in studies focused on enhancing the understanding of listener psychology and preferences.
Our project involves the development of a music survey application that gathers feedback from users about the songs they listen to, aiming to capture the emotional effects these songs evoke. The survey results reveal that users respond to music not only with primary emotions like happiness, sadness, or energy, but with a broader spectrum of emotions shaped by personal experiences. This highlights the multidimensional impact of music on individuals and underscores the need to analyze diverse emotional responses among users.
The next phase of the project involves developing a website to make these survey-based analyses accessible to a wider audience and to enrich user experience. Through these platforms, individuals will be able to experience music from various emotional perspectives and share their musical experiences within a community. This study aims to deepen the understanding of the connection between musical preferences and emotional responses, offering new insights into the psychological impact of music based on extensive data.
1.Crowdsourcing and Data Annotation in Music Labeling:
Crowdsourcing is an innovative method for harnessing the collective power of diverse groups worldwide through internet platforms. It enables people to participate by decomposing difficult tasks into smaller, more manageable components. It enables people to effectively contribute data, opinions, and information by decomposing difficult tasks into smaller, more manageable components. "Crowdsourcing involves seeking knowledge, goods, or services from a large body of people."[1] This approach produces richer, more responsible data and is consistent with the "wisdom of the crowd" theory. Crowdsourcing's cost-effectiveness, speed, and diversity make it increasingly valuable forlarge-scale data collection, giving organizations affordable access to a wide range of expertise. Most crowdsourcing projects revolve around four core concepts: gaining wisdom or knowledge through problem solving; creating new content, such as through wikis voting on policies or people; and crowdfunding, which involves raising money for various purposes, usually by collecting small amounts from a large pool of people.[2]
By allowing individuals from various cultural and personal backgrounds to tag music according to their emotional perceptions, crowdsourcing is crucial to our project's goal of creating a rich and varied emotional dataset. “By improving the dataset, learning applications can become more effective and personalized. Using a diverse emotional dataset improves the accuracy and efficiency of machine learning and AI applications. This allows us to better analyze different emotional interpretations.”[3] We hope to achieve a wide range of emotional interpretations by requesting contributions from a diverse user base, reflecting the ways in which different audiences perceive music. Crowdsourced data labeling harnesses the power of the crowd to expedite the process. Crowdsourcing platforms engage a large-scale diverse pool of human annotators from around the world, each contributing their skills and knowledge. [4]The dataset will be enhanced by this inclusivity, increasing its value for machine learning applications that need a variety of emotional data. Crowdsourcing platforms engage a large-scale diverse pool of human annotators from around the world, each contributing their skills and knowledge. A greater diversity of labelers can offset the potential for unconscious biases of any particular types of people to influence datasets.[5]
2.How Crowdsourcing Works for Music Labeling
In music, data tagging is a process used for this purpose, which allows machine learning models to learn musical features. Different types of tagging, such as note recognition, chord detection, tempo analysis, and even genre classification, enable for deeper analysis of music data.[6] Crowdsourcing tagging often uses methods such as voting or surveys, allowing users to efficiently categorize content. As part of our project’s crowdsourcing implementation, users will be asked to tag each track with the emotion they believe best expresses it. Like community-based platforms such as Last.fm, this tagging method utilizes personal input from a vast audience to create a strong dataset that reflects nuanced differences while showcasing shared feelings.The quality of data is a crucial element that directly influences tagging outcomes. Improper or insufficient tagging can adversely affect the effectiveness of machine learning models.
3.Data Annotation in Music
Data annotation, specifically for audio and music, involves categorizing and tagging audio files manually based on predefined criteria. Users who assign emotions to musical tracks will provide crowdsourced input for our project's emotional labeling. This annotated data then becomes training material for AI systems, helping them recognize and classify emotions in music more accurately.This method bridges the gap between objective machine learning and subjective human experiences by utilizing crowdsourcing for AI development.
4.Music and Emotion Analysis
Recent studies have highlighted that music evokes a range of emotional states (e.g., joy, sadness, anxiety), and cultural factors influence how listeners from different backgrounds perceive these emotions. For instance, both Chinese and U.S. listeners may recognize a piece as joyful or angry, though cultural nuances affect whether they view these emotions favorably or unfavorably. For instance, applications can analyze user preferences and emotional responses to music, tailoring playlists or recommending new songs that align with the user's current emotional state. [7] Researchers can quickly and cheaply gather this subjective data using crowdsourced platforms like Amazon Mechanical Turk[8], which is a great resource for creating algorithms that can suggest music based on users' moods. As this field continues to evolve, it promises to bridge the gap between human emotions and technological advancements, leading to more intuitive and empathetic software solutions [8].
5.Practical Applications and Benefits of Crowdsourcing for Our Project
Crowdsourcing is essential to our project's creation of an extensive, culturally diverse dataset for the classification of emotional music. This variation in emotional tagging supports the development of music platform recommendation systems, facilitating customized, mood-oriented music suggestions. The importance of crowdsourcing in user-driven data gathering is additionally illustrated by its support of psychological uses, including the therapeutic application of music to evoke or manage emotions. Through using the capabilities of crowdsourcing, we can collect an extensive amount of data from people with varied cultural backgrounds and musical tastes. This varied dataset will enable us to recognize nuanced emotional reactions to music, surpassing general classifications such as "happy" or "sad." We can dive into particular emotions such as nostalgia, anger, or calmness, offering a deeper comprehension of music's emotional influence. Additionally, crowdsourcing can assist us in revealing cultural variations in music perception. Various cultures might link specific musical elements to distinct emotions. Examining the feedback of individuals from diverse cultural backgrounds allows us to understand these cultural nuances and create more precise and culturally sensitive music recommendation systems.
6.Gamification for Enhanced Engagement
To keep users motivated, our platform incorporates gamification elements such as points, badges, levels, challenges, and leaderboards. These features encourage users to contribute more labels,fostering a sense of achievement and community participation. This combination of crowdsourcing and gamification aims to maximize label accuracy, user engagement, and dataset volume. By providing immediate feedback and recognition, gamification elements incentivize users to actively participate and contribute to the project. This creates a positive cycle where users are motivated to label more music, leading to a higher-quality dataset and a more engaging user experience.
7.Conclusion
Crowdsourcing in music annotation allows us to efficiently gather emotional labels from a diverse audience, yielding a data-rich foundation for training AI systems to recognize and categorize music by mood. The gamified approach incentivizes contributions, enhancing both the volume and quality of data collected. Ultimately, our project aims to advance the field of emotional music analysis, benefiting AI research, music recommendation algorithms, and therapeutic music applications.
References:
[1] https://www.braineet.com/blog/crowdsourcing
[2] https://www.britannica.com/money/crowdsourcing
[3] https://link.springer.com/article/10.1007/s10462-024-10831-1
[4][5] https://crowdsourcingweek.com/blog/crowdsourced-data-labeling/
[6] https://tr.shaip.com/blog/training-data-for-music-ml-models/
[7] https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0297712