Home - ua-datalab/AI-for-Professionals GitHub Wiki

AI Toolkit!

AI Enhanced Tools

mindmap
  ((**AI Toolkit**))
    id(**Code Development**)
      Visual Studio Code
      Jupyter Notebooks
    id(**Data Analysis Platforms**)
      KNIME
      OpenRefine
      Orange
    id(**Machine Learning <br/> Deep Learning**)
      Scikit-Learn
      PyTorch
      Tensorflow
    id(**Natural Language Processing <br/> NLP**)
      SpaCy
      NLTK
    id(**Geospatial Analysis**)
      QGIS
    id(**Databases**)
      Duckdb
    id(**Data Visualization**)
      Data-to-Viz
      Google Looker Studio 
      PowerBI
      Plotly
      Shiny
      Tableau
    id(**Generating Ideas**)
      ChatGPT
      Claude
      Open Source LLMs
         UArizona AI Verde
      Gemini
      Google Notebook
      Perplexity AI
    id(**Collaborative Research <br/> #38; Information Gathering**)
      Elicit
      Research Rabbit
      SciSpace
      Scite
      Semantic Search
    id(**Project Documentation**)
      GitHub Pages
      Google Docs
      Notion
    id(**Brainstorming <br/> #38; Mind Mapping**)
      NotebookLM
      Miro
      MindMeister
Loading

Collaborative Research & Information Gathering

  • Connected Papers. Connected papers, a web-based AI tool that helps researchers explore and discover academic papers in their field of interest.
  • Elicit. Elicit AI is a free tool developed by Ought that helps researchers with various aspects of the research process, particularly literature reviews.
  • Perplexity. Perplexity AI is an artificial intelligence-powered search engine that aims to provide users with comprehensive and accurate answers to their questions. It
  • Research Rabbit. ResearchRabbit is a free, AI-powered online platform that helps researchers map and explore the literature in their field.
  • SciSpace. SciSpace is an AI-powered research platform designed to help academics efficiently navigate scholarly literature.
  • Scite. Scite is an AI-powered research platform that helps users understand and evaluate research articles by providing context and classification for citations.

Generating ideas

  • Open AI ChatGPT. ChatGPT is a large language model chatbot created by OpenAI that can engage in human-like conversations and generate text based on various prompts. It powers Microsoft Copilot.
  • Gemini. Google Gemini is a large language model (LLM) and multimodal AI assistant that can be accessed through a chatbot interface.
  • Google NotebookLM. NotebookLM (Google NotebookLM) is a research and note-taking online tool developed by Google Labs that uses artificial intelligence (AI), specifically Google Gemini, to assist users in interacting with their documents.
  • Google AI Studio. Google AI Studio is a free, browser-based Integrated Development Environment (IDE) that allows users to experiment with and prototype applications using Google's Gemini family of generative AI models.
  • Claude. Claude AI is a large language model (LLM) and AI chatbot developed by Anthropic that excels at natural language processing (NLP).
  • U of Arizona AI Verde. Local LLMs.
  • Chatbox. Chatbox software is a user interface, typically a pop-up window or widget on a website or application, that facilitates communication between a user and either a live agent (human) or a chatbot (AI-powered). Requires an API.

General Reference: Generative AI & Prompt Engineering


Specific applications

Code development

Data analysis platforms

  • KNIME. KNIME (Konstanz Information Miner) is a free and open-source data analytics platform that allows users to build data science workflows without extensive coding skills.
  • OpenRefine. OpenRefine is a free, open-source software tool that cleans, transforms, and enriches data, especially when dealing with messy or incomplete datasets.
  • Orange Data Mining. Orange is a visual programming toolkit that facilitates data visualization, machine learning, and data analysis.

Databases

Datasets

Healthcare

Data Visualization

  • Data-to-Viz.com. From Data to Viz leads you to the most appropriate graph for your data. It links to the code to build it and lists common caveats you should avoid.
  • Exploratory. Exploratory’s Simple UI experience makes it possible for anyone to use Data Science to explore data quickly, discover deeper insights, and communicate effectively.
  • Tableau. Tableau is a visual analytics platform and business intelligence (BI) software that helps users visualize, analyze, and share data.
  • PowerBI. Power BI is a suite of business analytics services and software from Microsoft designed to help users visualize and analyze data to gain insights and make informed decisions.
  • Shiny. A Shiny app is an interactive web application built using the Shiny framework, which is part of the R programming language.
  • plotly. Plotly provides online graphing, analytics, and statistics tools for individuals and collaboration, as well as scientific graphing libraries for Python, R, MATLAB, Julia, and others.
  • Google Looker Studio. Looker Studio is a free, web-based data visualization and reporting tool from Google Cloud that allows users to create interactive dashboards and reports from various data sources.

Web development oriented

  • Gradio. Gradio is a Python library that simplifies building interactive web applications, particularly for machine learning demos and applications.
  • Streamlit. Streamlit is an open-source Python library that makes it easy to build and share interactive, data-rich web apps.

Geospatial applications

  • QGIS. QGIS (formerly Quantum GIS) is a free and open-source Geographic Information System (GIS) software that allows users to create, analyze, and manage spatial data.

Machine Learning / Deep Learning

  • Scikit-Learn. Scikit-learn is a free and open-source machine learning library for the [Python](https://en.wikipedia.org/wiki/Python_(programming_language) programming language.
  • PyTorch. PyTorch is an open-source machine learning framework based on the Torch library, primarily developed by Meta AI. It is used for applications such as computer vision and natural language processing.
  • Tensorflow. TensorFlow is a software library for machine learning and artificial intelligence. It can be used across a range of tasks, but is used mainly for training and inference of neural networks. It is one of the most popular deep learning frameworks, alongside others such as PyTorch.

Natural Language Processing


Created: 04/29/2025 (C. Lizárraga)

Updated: 05/17/2025 (C. Lizárraga)

DataLab, Data Science Institute, University of Arizona.

CC BY-NC-SA 4.0

⚠️ **GitHub.com Fallback** ⚠️