Skip to content

Latest commit

 

History

History
79 lines (76 loc) · 8.29 KB

meeting_notes.md

File metadata and controls

79 lines (76 loc) · 8.29 KB

Monday, February 13, 2023

Tuesday, February 14, 2023

  • B73 v5 genome (FASTA) (on second thought you don't need this taking up space on your computer). This is the overall site for any B73 v5-related files you may want: https://download.maizegdb.org/Zm-B73-REFERENCE-NAM-5.0/
  • Genome annotation (GFF3): Download "Zm-B73-REFERENCE-NAM-5.0_Zm00001eb.1.gff3.gz" from the above link. It is a compressed file, similar to ZIPs. This compression is a common one used by bash users called gzip. In the command line, you can uncompress it with the command "gunzip". Open the terminal and navigate to the same directory you've put the GFF3 file. Then just enter "gunzip m-B73-REFERENCE-NAM-5.0_Zm00001eb.1.gff3.gz" to uncompress it. You navigate through your file system using the command "cd" for change directory (aka folder). "cd .." will bring you back out of a directory you've entered. "cd directory_name" will move you into a directory. The "ls" command will list the files and folders in the directory you're in. I believe when you open your terminal you'll be in your home directory, "~". To navigate to Downloads, you run "cd Downloads". Not too sure about the MacOS file system.
  • Todo for Mikhaiya: Look for a BED format file binding peak data to compare to annotated genes

Tuesday, February 21, 2023

Tuesday, February 28, 2023

Tuesday, March 7, 2023

  • Meeting just Tuesday from now on, next week over Zoom
  • Continue to download TF binding sites from jbrowse. Do not download histone ChIP-seqs (e.g. H3K27me3).
  • TODO: Mikhaiya email for bigger BED files, starting with John Portwood. [email protected]
  • Alex hunt down someone at Maize Meeting
  • Getting set up with BioPython
  • Mikhaiya's collab link: https://colab.research.google.com/drive/14RgYijXGVbR4IARXJ3koEnNoqjEYqGdH
  • Reading the Zm...gff file using a GFF parser compatible with BioPython: https://biopython.org/wiki/GFF_Parsing
  • Python library for data/tables: Pandas https://pandas.pydata.org/docs/reference/api/pandas.read_csv.html code below. The top 4 lines of the GFF3 file are not part of the table and just give information about the gene annotation, so i skip them with the parameter "skiprows") maize_gene_annotation_table = pandas.read_csv("Zm-B73-REFERENCE-NAM-5.0_Zm00001eb.1.gff3", sep="\t", skiprows=4)
  • TODO: Mikhaiya set up Github with Colab. Clone the maize_binding_colab repo.

Tuesday, March 21, 2023

  • Installed PyBedGraph, uploaded necessary chromosome sizes file
  • TODO: Mikhaiya try reading a bedGraph fil with this package: https://pypi.org/project/pyBedGraph/
  • Notes on grad school:
  • Doing a research thesis heavily recommended for bioinformatics/bio. It demonstrates independence and many jobs will ask why if you don't do it.
  • Many places will require 3 letters of recommendation. UIUC explicitly said they prefered research supervisors. If you want our lab to write a letter, let's start thinking about it now so we can have some good things to say.
  • I applied fall of 2019 and got into UCSD and UIUC, rejected from UWashington, Cold Spring Harbor, and later Duke. We can talk odds/good schools for your objectives.
  • Graduate fellowship resources (Alex has bank of personal and research statements available): https://www.nsfgrfp.org/

Tuesday, March 28, 2023