User Tools

Site Tools


readme.illumina2pregs

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Next revision
Previous revision
readme.illumina2pregs [2014/05/26 09:48] – created ignacioreadme.illumina2pregs [2024/03/25 18:22] (current) – external edit 127.0.0.1
Line 1: Line 1:
-=====illumina2pregs===== +====== Illumina2preGS ====== 
- - a program to convert genomic information + program to convert genomic information from Illumina files for the BLUPF90 family of programs\\ 
-                 from Illumina files for the BLUPF90 family of programs+
  Ignacio Aguilar - INIA Las Brujas  Ignacio Aguilar - INIA Las Brujas
 +
  email: iaguilar at inia.org.uy  email: iaguilar at inia.org.uy
 +
  02/10/11 - 06/16/12  02/10/11 - 06/16/12
  
 ======Summary====== ======Summary======
  
-Tool for converts FinalReport files from Genome Studio into file suitable for reading with pregsf90 +Tool for converting ''FinalReport'' and ''SNP_Map.txt'' files from Genome Studio into files suitables for reading with ''pregsf90''
-Read only AB codes and generates integer variable 0,1,2  or 5 for missing+
  
-======USAGE====== +Reads only AB codes and generates integer variable 0,1,2  or 5 for missing
-   <file>illumina2pregs --snpfile <yourSnpFinalReportFilename> </file>+
  
-   the only mandatory argument that is required is the --snpfile +======Usage======
-   other options can be modified with optional arguments:+
  
-Optional Arguments +<file>illumina2preGS --snpfile <yourSnpFinalReportFilename> </file>
-  --mapfile      - File name for the map file. Default "SNP_Map.txt" +
-  --genome       - Indicate type of genome (1 - bovine, 2 - ovine) +
-  --nosortmap    - Genotype file will be in the same order as given  +
-  --codeab  <n - Indicate column number for the first allele with code AB (defalut 7)  +
-  --gc  <n thr - Set column number (n) of GC and the cutoff (thr) for exclude call (default from GenomeStudio 0.15)  +
-  --alphasize    - Set the maximun length to store alphnumeric IDs (default 10) +
-  --snpfile_out  - Set a file name for output SNP file (default snps2pregs)+
  
 +The only mandatory argument that is required is the ''--snpfile''
  
-File formats: +Other options can be modified with optional arguments:
-=============+
  
-Input files +=====Optional Arguments===== 
------------ +<file>--mapfile</file> 
-SNP_Map.txt: + File name for the map file. Default "SNP_Map.txt" 
-   1 Index                                              +<file>--genome</file> 
-   2 Name                                               + Indicate type of genome (1 bovine, 2 ovine) 
-   3 Chromosome                                         +<file>--sortmap</file> 
-   4 Position                                           + Genotype file will be sorted by chromosome and position 
-   5 GenTrain                                           +<file>--gc  <thr></file> 
-   6 Score                                              + Set GC cutoff (thr) for exclude call (default from GenomeStudio 0.15)  
-   SNP                                                +<file>--alphasize</file> 
-   8 ILMN                                               + Set the maximun length to store alphanumeric IDs (default 10) 
-   9 Strand                                             +<file>--snpfile_out</file> 
-  10 Customer                                           + Set a file name for the output of the SNP file (default snps2pregs) 
-  11 Strand                                             +<file>--mapfile_out</file> 
-  12 NormID+ Set a file name for the output of the MAP file (default snp_map)
  
-FinalReport.txt: +=====File formats=====
-** this is the default but use --codeab for change it +
-   1 SNP_Name +
-   2 Sample ID +
-   3 Allele1 - Forward +
-   4 Allele2 - Forward +
-   5 Allele1 - Top +
-   6 Allele2 - Top +
-   7 Allele2 - AB +
-   8 Allele1 - AB +
-   9 GC Score +
-  10 X +
-  11 Y+
  
-Output files: +====Input files==== 
--------------+''SNP_Map.txt''  
  
-snps2pregs:+The following fields should be present in the SNP_Map.txt file,    
 + 
 +Any order of columns is supported.   
 + 
 +Column positions will be assumed based on the keywords                                         
 +  * Name                                               
 +  * Chromosome                                         
 +  * Position                                           
 +Other fields will be ignored 
 + 
 + 
 +''FinalReport.txt''   
 + 
 +The following fields should be present in the Final Report file, Any order of columns is supported.   
 + 
 +Column positions will be assumed based on the keywords:   
 +  * SNP_Name   
 +  * Sample ID   
 +  * Allele2 - AB   
 +  * Allele1 - AB   
 + 
 + 
 +Optional fields:   
 +  * GC Score 
 + 
 +Other fields will be ignored. 
 + 
 +====Output files==== 
 + 
 + 
 +''snps2pregs''
     ID (alphnumeric identification of lenght "alphasize", default 10)     ID (alphnumeric identification of lenght "alphasize", default 10)
     one space     one space
     genotypes: 0, 1, 2, for AA, AB and BB respectively and 5 for missing.     genotypes: 0, 1, 2, for AA, AB and BB respectively and 5 for missing.
  
-snp_map: +''snp_map'' 
-   index+   name 
    2 chromosome    2 chromosome
    3 position     3 position 
-   name  +   4 index in original SNP_Map.txt
-   index in original SNP_Map.txt+
    
 +SNP without defined position (CHR = 0) in the SNP_Map.txt will be located in a new CHR 32 in the output snp_map, but still with 0 position. 
readme.illumina2pregs.txt · Last modified: 2024/03/25 18:22 by 127.0.0.1

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki