Home - ua-datalab/AI-for-Professionals GitHub Wiki
mindmap
((**AI Toolkit**))
id(**Code Development**)
Visual Studio Code
Jupyter Notebooks
id(**Data Analysis Platforms**)
KNIME
OpenRefine
Orange
id(**Machine Learning <br/> Deep Learning**)
Scikit-Learn
PyTorch
Tensorflow
id(**Natural Language Processing <br/> NLP**)
SpaCy
NLTK
id(**Geospatial Analysis**)
QGIS
id(**Databases**)
Duckdb
id(**Data Visualization**)
Data-to-Viz
Google Looker Studio
PowerBI
Plotly
Shiny
Tableau
id(**Generating Ideas**)
ChatGPT
Claude
Open Source LLMs
UArizona AI Verde
Gemini
Google Notebook
Perplexity AI
id(**Collaborative Research <br/> #38; Information Gathering**)
Elicit
Research Rabbit
SciSpace
Scite
Semantic Search
id(**Project Documentation**)
GitHub Pages
Google Docs
Notion
id(**Brainstorming <br/> #38; Mind Mapping**)
NotebookLM
Miro
MindMeister
- Connected Papers. Connected papers, a web-based AI tool that helps researchers explore and discover academic papers in their field of interest.
- Elicit. Elicit AI is a free tool developed by Ought that helps researchers with various aspects of the research process, particularly literature reviews.
- Perplexity. Perplexity AI is an artificial intelligence-powered search engine that aims to provide users with comprehensive and accurate answers to their questions. It
- Research Rabbit. ResearchRabbit is a free, AI-powered online platform that helps researchers map and explore the literature in their field.
- SciSpace. SciSpace is an AI-powered research platform designed to help academics efficiently navigate scholarly literature.
- Scite. Scite is an AI-powered research platform that helps users understand and evaluate research articles by providing context and classification for citations.
- Open AI ChatGPT. ChatGPT is a large language model chatbot created by OpenAI that can engage in human-like conversations and generate text based on various prompts. It powers Microsoft Copilot.
- Gemini. Google Gemini is a large language model (LLM) and multimodal AI assistant that can be accessed through a chatbot interface.
- Google NotebookLM. NotebookLM (Google NotebookLM) is a research and note-taking online tool developed by Google Labs that uses artificial intelligence (AI), specifically Google Gemini, to assist users in interacting with their documents.
- Google AI Studio. Google AI Studio is a free, browser-based Integrated Development Environment (IDE) that allows users to experiment with and prototype applications using Google's Gemini family of generative AI models.
- Claude. Claude AI is a large language model (LLM) and AI chatbot developed by Anthropic that excels at natural language processing (NLP).
- U of Arizona AI Verde. Local LLMs.
- Chatbox. Chatbox software is a user interface, typically a pop-up window or widget on a website or application, that facilitates communication between a user and either a live agent (human) or a chatbot (AI-powered). Requires an API.
General Reference: Generative AI & Prompt Engineering
- Visual Studio Code (VS Code). Visual Studio Code is a free, cross-platform code editor developed by Microsoft.
- Jupyter Notebooks. A Jupyter Notebook is a web-based interactive computing environment that allows users to create and share documents containing live code, equations, visualizations, and narrative text.
- KNIME. KNIME (Konstanz Information Miner) is a free and open-source data analytics platform that allows users to build data science workflows without extensive coding skills.
- OpenRefine. OpenRefine is a free, open-source software tool that cleans, transforms, and enriches data, especially when dealing with messy or incomplete datasets.
- Orange Data Mining. Orange is a visual programming toolkit that facilitates data visualization, machine learning, and data analysis.
- DuckDB. DuckDB is a high-performance, embedded, in-process, OLAP (Online Analytical Processing) relational database management system (RDBMS) that is designed for data analysis.
- Data.gov
- Dataset Search Google
- Kaggle Datasets
- Papers with code Datasets
- University of California at Irvine
- Data-to-Viz.com. From Data to Viz leads you to the most appropriate graph for your data. It links to the code to build it and lists common caveats you should avoid.
- Exploratory. Exploratory’s Simple UI experience makes it possible for anyone to use Data Science to explore data quickly, discover deeper insights, and communicate effectively.
- Tableau. Tableau is a visual analytics platform and business intelligence (BI) software that helps users visualize, analyze, and share data.
- PowerBI. Power BI is a suite of business analytics services and software from Microsoft designed to help users visualize and analyze data to gain insights and make informed decisions.
- Shiny. A Shiny app is an interactive web application built using the Shiny framework, which is part of the R programming language.
- plotly. Plotly provides online graphing, analytics, and statistics tools for individuals and collaboration, as well as scientific graphing libraries for Python, R, MATLAB, Julia, and others.
- Google Looker Studio. Looker Studio is a free, web-based data visualization and reporting tool from Google Cloud that allows users to create interactive dashboards and reports from various data sources.
- Gradio. Gradio is a Python library that simplifies building interactive web applications, particularly for machine learning demos and applications.
- Streamlit. Streamlit is an open-source Python library that makes it easy to build and share interactive, data-rich web apps.
- QGIS. QGIS (formerly Quantum GIS) is a free and open-source Geographic Information System (GIS) software that allows users to create, analyze, and manage spatial data.
- Scikit-Learn. Scikit-learn is a free and open-source machine learning library for the [Python](https://en.wikipedia.org/wiki/Python_(programming_language) programming language.
- PyTorch. PyTorch is an open-source machine learning framework based on the Torch library, primarily developed by Meta AI. It is used for applications such as computer vision and natural language processing.
- Tensorflow. TensorFlow is a software library for machine learning and artificial intelligence. It can be used across a range of tasks, but is used mainly for training and inference of neural networks. It is one of the most popular deep learning frameworks, alongside others such as PyTorch.
- spaCy. spaCy is a free, open-source library for advanced Natural Language Processing (NLP) in Python.
- NLTK. NLTK (Natural Language Toolkit) is a leading Python library for working with human language data.
Created: 04/29/2025 (C. Lizárraga)
Updated: 05/17/2025 (C. Lizárraga)
DataLab, Data Science Institute, University of Arizona.