Chapter 5 problem set 2 - UCD-pbio-rclub/python_problems GitHub Wiki

Julin

Rie

Import "Brapa_cpm.csv" file.

(1) Find variance for each gene.

(2) Sort the genes by variance. Find the most variable gene in the file.

Min-Yao

Import the data that we used last week as a DataFrame.

(1) In order to know whether there are differences across libraries, please calculate the mean, standard deviation, 25% quantile, 50% quantile, 75% quantile and the maximum value of gene expression levels for each library.

(2) In order to know the general expression pattern of each gene, please calculate the mean, standard deviation, 25% quantile, 50% quantile, 75% quantile and the maximum value of expression levels across libraries for each gene.

(3) Please find the most highly expressed gene in each library.

(4) Please find the library that has the highest expression level in each gene.

Ruijuan

continue with https://github.com/cuttlefishh/python-for-data-analysis/blob/master/assignments/assignment6.md A & B

D1) convert temp to dataframe, use apply function to convert temperature_deg_c to temperature_deg_f and store the data in a new dataframe called temp1

D2) use list comprehension to achive the above task

E1) sort the rows in df by sequence_split_libraries values from high to low and store the results as temp3

E2) sort the columns in df by column names from A to Z and store the results as temp4.

John

Using import pandas_datareader.data as web

  1. create a series using your favorite stock company

  2. Find the worse trading day for the company

  3. Find the best trading day for the company

  4. Find the average trading price