Example of use GUI for D. melanogaster eye color genes - aakechin/NGS-PrimerPlex GitHub Wiki
Genome preparations
NCBI Genome database, search for Drosophila melanogaster and click genome after words "Download sequences in FASTA format for".
If you have already prepared it, skip this step. To prepare reference genome, go toSave the archive and unzip it. WARNING! All files that will be used in GUI-version should be placed in folders nearest to C:\ without any non-latin or non-standard (/,",",_,-,<,> etc) symbols. Otherwise, GUI will not work.
Then, go to the bottom of the page and open one by one links to genome chromosomes (NC_004354.4, NT_033779.5, NT_033778.4 etc.).
https://www.ncbi.nlm.nih.gov/nuccore/NC_004354.4 should be opened) click "Send to:" -> File -> Choose "GenBank (full) -> "Create File".
At the top of the opened pages (e.g.Each GenBank-file should be named as this chromosome is called in the reference genome FASTA-file. To know, how is it called in the reference genome file. Open the downloaded reference genome FASTA-file with WordPad and look at first words near ">". Other names can be found by searching for the ">" symbol:
For example, for the D. melanogaster chromosome X GenBank-file can be named as NC_004354.4.gb.
Finally, you should have a directory with the following files:
- GCF_000001215.4_Release_6_plus_ISO1_MT_genomic.fna - FASTA-file with reference genome
- NC_004353.4.gb - chrX
- NT_033779.5.gb - chr2L
- NT_033778.4.gb - chr2R
- NT_037436.4.gb - chr3L
- NC_004353.4.gb - chr4
- NT_033777.3.gb - chr3R
- NC_024512.1.gb - chrY
- NC_024511.2.gb - chrMT
Extraction of genome regions
We chose four genes associated with eye color mutations: chocolate (VhaAC39-1), maroon (Vps16A), mahogany (CG12207), and red Malpighian tubules (CG13646). List of genes was taken from Paaqua Grant et al. 2016.
To create NGS-panel, go to "Extract regions" of GUI.
Fill names of genes into the 1st column of the left table. To add new row, press "Add gene". After that, choose the directory where GenBank-files of the reference genome were saved. Choose reference genome FASTA-file and file for output.
Press "Extract regions". "Starting extraction of genome regions..." should be shown. Wait until "NGS-PrimerPlex finished!" will be shown.
File with extracted regions will be in the chosen directory.
Open it in Microsoft Excel (or other similar programs). To look at it in the Microsoft Excel, open it, go to Data, and Split data onto columns. Change number of multiplexes for each region in 5th column from 1 to necessary number of multiplex reactions. This number can be estimated from the following facts: (1) total number of primer pairs (about 50-70 primer pairs can be joined together); (2) how many overlapping regions (overlapped primer pairs can not be joined); (3) size of amplicons (larger amplicons - less overlappings). Initially, we tried to sort primer pairs to 3 multiplex reactions, so change 1 in the 5th column from 1 to "1,2,3" (without quotes) and save it as a tab-delimited text-file:
Changing default settings
Default settings are for DNA samples from formalin-fixed paraffin-embedded (FFPE) tissue specimens. To study germline mutations, like in this case, we can use amplicons of about 150 bp. So, go to "Settings" and change Minimal amplicon length to 130 bp, optimal and maximal - to 150 bp. Also, you can descrease number of primer pairs designed for 1 locus ("Multiplexing" section) to 5 (then the process will be faster). We can leave other values by default. If you want, you can save your settings to some file and use them in the future:
Running primer design
Go to "Design primers". Values for the file with regions and the reference genome file will be filled automatically from your choice while extracting genome regions. Turn off "Check for covering SNPs", if you don't have VCF-file with known SNPs for the organism or target genes. Fill left and right adapters with one of NGS-PrimerPlex examples or your own sequences (e.g. "ctctctatgggcagtcggtgatt" and "ctgcgtgtctccgactcag").
Press "Start". Initial primer design will take some time (upto 1 hour).
Most likely, it will show the message that for 1-3 regions primers could not be designed with the defined parameters.
Go back to "Settings" and try to increase Maximal primer length from 28 to 32.
Go to "Design primers" again and choose file with draft list of primers. It will let you not to design primers previously designed again. XLS-file with draft list of primers is located in the same directory where your input file is.