GSoC 2019 Ideas - nteract/nteract GitHub Wiki

GSoC 2019 Project Ideas

1. Improve Data Explorer Summary statistics

The nteract Data Explorer is a React component that provides rich rendering of pandas data frame within nteract applications. This closed pull request outlines some of the changes that need to be made in order to provide summary statistics to Data Explorer.

A screenshot of the Data Explorer

Motivation

Resolving this will allow use to provide useful descriptions about the data frames that are rendered within Data Explorer.

Expected Skills

You should be familiar with Python.

Available Mentors

Safia Abdalla (@captainsafia)

2. Improved Spark Integration

Spark is a popular data processing tool for large amounts of data. The Spark Monitor is a tool that allows users to view the progress of Spark jobs. We need your help to integrate the Spark Monitor into nteract, Jupyter, and Spark.

Spark Monitor

Motivation

Completing this feature would improve the user-experience for working with Spark in nteract.

Expected Skills

You should be familiar with Spark, JavaScript, and Python.

Available Mentors

  • Kyle Kelley (@rgbkrk)

3. Improve Unit and Integration Testing

Test coverage for the nteract project is currently at 61% and work on our integration test suite has stalled. We need your help to improve our testing story!

Motivation

An increase in test coverage and more integration tests would help improve code quality and speed up our verify-and-release process.

Expected Skills

You should be familiar with JavaScript, familiarity with the TypeScript language is a bonus. You should be familiar with the concept of unit and integration tests.

Available Mentors

Safia Abdalla (@captainsafia)