Chatfish - cmubuild18/Build18 GitHub Wiki

Chatfish

If you cannot make friends during 0 week or have goldfish memory when it come to putting names to faces, use Chatfish to make ends meet.

Thumbnail Image

"Chatfish" is an innovative project for new students, using a Raspberry Pi 4 Model B, camera, and microphone for facial recognition and interaction. It aims to ease social awkwardness by remembering faces and names, and facilitating engaging conversations in the initial school week. Key concepts include facial recognition, text-to-speech, speech-to-text, text-to-text, and language models. Our team efficiently integrated these concepts during the build week, optimizing algorithms and fine-tuning systems for Chatfish to generate engaging responses.

Behind Chatfish�s conversation engine are three components: a local speech recognition model, a local instruction-tuned LLM, and a text-to-speech API. We use whisper.cpp, an efficient port of OpenAI�s whisper model. We use a 4-bit quantized C++ port of Meta�s llama2-7b-chat model to generate text on-device and the converted gguf format weights and code from the llama-cpp project, which is built on the efficient ggml framework - this allows us to run full inference of a large transformer model with only 8GB of RAM. To convert text into speech, we use the ElevenLabs API.

The project achieved its goals:

  • Facial Recognition: Chatfish recognizes known and unknown faces.
  • Interactive Conversations: It responds effectively, facilitating smooth interactions.
  • Engagement: Chatfish's ability to speak and listen reduces intimidation.
  • Personalization: Remembering faces and names creates a welcoming experience.

Team Members

Team Member Photo
Gina Seo Team Member Photo
Gio Team Member Photo
Justin Team Member Photo
Dylan Team Member Photo
Aaron Team Member Photo

Photos

https://drive.google.com/drive/folders/1ypGFgv1UAcA2FQH4_R3uQQrDwWPXQ0h2?usp=sharing

Parts

Part Unit Price Quantity Cost Needs to be Orderedthrough Build18? (Y/N) Part Link
10 Pcs Servo Motor 18.77 1 18.77 Y https://www.amazon.com/gp/product/B07L2SF3R4/ref=ox_sc_act_title_1?smid=AMIHZKLK542FQ&th=1
Camera module 19.99 1 19.99 Y https://www.amazon.com/gp/product/B01ER2SKFS/ref=ox_sc_act_title_3?smid=AR79UU9IO6XWC&psc=1
Raspberry Pi Zero 24.99 1 24.99 Y https://www.amazon.com/gp/product/B07C7FHJDX/ref=ox_sc_act_title_4?smid=A2QE71HEBJRNZE&psc=1
Mountable Fish 58.07 1 58.07 Y https://www.amazon.com/Gemmy-Inflateables-Holiday-G08-47957/dp/B00OW8GLRM/ref=sr_1_2?crid=3CKCFP8KFHLG5&keywords=talking+fish&qid=1699829087&sprefix=talking+fis%2Caps%2C93&sr=8-2&ufe=app_do%3Aamzn1.fos.006c50ae-5d4c-4777-9bc0-4513d670b6bc
Speaker $7.68 2 $15.36 Y https://www.amazon.com/Gikfun-Speaker-Stereo-Loudspeaker-Arduino/dp/B01LN8ONG4/ref=pd_bxgy_img_sccl_1/144-6717060-1012851?pd_rd_w=KnINm&content-id=amzn1.sym.7746dde5-5539-43d2-b75f-28935d70f100&pf_rd_p=7746dde5-5539-43d2-b75f-28935d70f100&pf_rd_r=8KNFS07Y3BQ7QHVJYFZN&pd_rd_wg=Lg7ta&pd_rd_r=49f76eb0-4fab-4c7d-ac5c-d436fd95fad9&pd_rd_i=B01LN8ONG4&psc=1
Microphone 18.39 2 36.78 Y https://www.amazon.com/gp/product/B074BLM973/ref=ox_sc_act_title_2?smid=A3RVC0VS3R6WKL&psc=1
Fish head 22.09 1 22.09 Y https://a.co/d/1KHPQfZ
SanDisk 128GB Extreme microSDXC UHS-I Memory Card with Adapter - Up $19.99 1 $19.99 N
CanaKit 3.5A Raspberry Pi 4 Power Supply (USB-C) $9.99 1 $9.99 N
Raspberry Pi 4 Computer Model B 8GB Single Board Computer Suitable $79.90 1 $79.90 N https://www.amazon.com/Raspberry-Pi-Computer-Suitable-Workstation/dp/B0899VXM8F/ref=psdc_193870011_t1_B07C7FHJDX?th=1
Total Cost: 305.93
Remaining Budget:(out of $300) -5.93
`