Luke Denoncourt's Work Report - spatial-data-discovery/sdd-2021 GitHub Wiki

Work Journal



Date: 9/6/2021

Summary:

  • Because I have two computers, I ensured that atom, pandocs, and git were installed on both and they are now.

In the future:

  • Continue setting up pages on Github and familiarize myself with the repository

Date: 9/8/2021

Summary:

  • I put the file sb1-LukeD77 in the sandbox1 folder on the master branch.
  • I created the about-LukeD77 website which is the about the coder project and it is in the main Spatial data repository

In the future:

  • Think about the question I want to ask for my project or what data I want to work with

Question of the week:

What is spatial?

Spatial is a facet of an object that includes some sort of area.


Date: 9/13/2021

Summary:

  • I put the file resnet50_flowering_or_not.py in the scripts folder of the master branch of the main page website. It took lots of tinkering to make it user friendly in the command line by including input lines to change model parameters.

In the future:

  • Think about the question I want to ask for my project or what data I want to work with still
  • Put model data in some repository that can be accessed by people who want to use my code.

Date: 9/15/2021

Summary:

  • I placed the sample data for the resnet50_flowering_or_not.py file in my google drive and make the link sharable to everyone. To keep it simple I have the .py print the link and then ask for an input when they have downloaded the data. I tried using many different tools such as wget or gdrive or gpower and many others but none of them seemed to work at all or easily, so this was the easiest implementation for others to use from what I can provide.

In the future:

  • Think about the question I want to ask for my project or what data I want to work with still

Question of the week:

What is utility?

Utility means serving some sort of use or function.


Date: 9/22/2021

Summary:

  • I replaced the resnet50 script with a dad joke emailer for the utility script assignment to keep it simple and better display the emailing code. I also am working on sandbox2 and am focusing on how to plot the coordinates on what map and then I will look into doing that for all images.

In the future:

  • Think about the question I want to ask for my project or what data I want to work with still

Question of the week:

What is spatial data?

Spatial data is any data plotted on a graph or has a location attribute that contributes some sort of information.


Date: 9/22/2021

Summary:

  • I replaced the resnet50 script with a dad joke emailer for the utility script assignment to keep it simple and better display the emailing code. I also am working on sandbox2 and am focusing on how to plot the coordinates on what map and then I will look into doing that for all images.

In the future:

  • Think about the question I want to ask for my project or what data I want to work with still

Question of the week:

What is spatial data?

Spatial data is any data plotted on a graph or has a location attribute that contributes some sort of information.


Date: 9/29/2021

Summary:

  • I downloaded QGIS 3.16 on both my laptops.

In the future:

  • Think about the question I want to ask for my project or what data I want to work with still

Question of the week:

What is space/place?

I think a place is something of significance or recognition. If it is named, it is a place. Space is what is enclosed within a place and surrounds a place, so everything that is not the idea of the place.


Date: 10/6/2021

Summary:

  • Worked on installing QGIS on both my laptops. They can be accessed from the desktop. Both also have a shapefile with world boundaries.

In the future:

  • Continue thinking about project. Perhaps visualizing changing in political districts after census data is released and visualizing how often those stay the same political party afterwards. Basically visualizing gerrymandering.

Question of the week:

How does it look? Visualizing data requires a basic question upon graph generation: how does it look? Data visualization portrays data in a way that conveys information, and if they graph looks disorienting then it does not effectively convey what you are trying to describe.


Date: 10/13/2021

Summary:

  • Worked on sparse data challenge. We started by looking up methods for gap filling while also visualizing the ascii files to see what data were missing. Then we found that all these methods were implemented in R so we figured we would create our own method of gap filling. After creating a google doc and github branch, Bryce took the time to write out the code and I did the discussion questions. All of which are now posted on the main branch.

In the future:

  • Continue thinking about project. Perhaps visualizing changing in political districts after census data is released and visualizing how often those stay the same political party afterwards. Basically visualizing gerrymandering.

Question of the week:

What's the process? The process of gapfilling I would not think is uncommon since there are always clouds somewhere on the Earth and satellite image data is predominating spatial datasets.


Date: 10/20/2021

Summary:

  • Made a notebook file for analyzing sparse data challenge files to answer the quiz questions.

In the future:

  • Continue thinking about project. Perhaps visualizing changing in political districts after census data is released and visualizing how often those stay the same political party afterwards. Basically visualizing gerrymandering. Need to find those data.

Question of the week:

What is the challenge?

Data Science faces numerous problems with the introduction of Big Data. Specifically, we are concerned with volume, variety, velocity, variability, and veracity. We need to change how we store our data, how we access it, and how we analyze it. While computing speed and storage space has increased over time, numerous data science fields are generating orders of magnitude more data and are grappling with unstructured data, which can't easily be analyzed or stored.


Date: 10/27/2021

Summary:

  • I worked with HDF5 files and going through the readings on the topic. I made sure to update the class repository that has the read and write HDF5 files.

In the future:

  • Continue thinking about project. Perhaps visualizing changing in political districts after census data is released and visualizing how often those stay the same political party afterwards. Basically visualizing gerrymandering.

Question of the week:

What attributes?

I think important attributes are authorship, general purpose of the repository, and any additional information deemed pertinent based upon the specific project.


Date: 11/4/2021

Summary:

  • I familiarized myself with hdf_read and hdf_write.

In the future:

  • Project data could be census data of race for the state of North Carolina and compare to how districts are redrawn and win rate of political parties to visualize gerrymandering. These data should be available.

Question of the week:

What dimensions?

Dimensions are what factors you are measuring in you data. However, in spatial data, two dimensions are normally required to plot location on a map. 3 dimensions are possible if also plotting elevation. The number of dimensions to plot where something is and then describe in a specific way what measured value is happening leads to a dataset of increasing dimensions.