In this tutorial, I would like to share how one can import SSR (simple sequence repeats) marker data on to TASSEL software, and perfrom analyses such as imputation, General Linear Model and Principle Component Analysis.

1.0 Installing TASSEL software

Download and install the latest version of the TASSEL software at this link: https://www.maizegenetics.net/tassel

Ignore this step if you already have installed TASSEL.


1.1 Input data

The input SSR marker data must formated as instructed below:

  • Add "Numeric” in the first line of the data
  • The missing data (-1) was denoted as “NA”
  • TASSEL only allows numeric marker data ranging from 0 to 1
  • Save your numeric data in a text file format (.txt) prior to loading in tassel


Note: If you have two alleles per SSR marker then you can load it on TASSEL as a numeric data as 0 for major and 1 for minor alleles. However, If SSRs alleles have more than 2 alleles per marker I suggest converting as below:

  • Most common = A
  • 2nd most common = C
  • 3rd most common = G
  • 4th most common = T
  • 5th most common = +
  • All other alleles = - (dash)


See an example of the input file: Input data


1.2 Importing phenotype and genotype files

Import the files by following the steps shown below.

Tip! Both files can be opened at same time holding CTRL and clicking the file names.

load

1.3 Imputation

This is an optional step, however, in order to conduct PCA analysis you need to impute and should not have any missing data.

Follow the below steps to perform numerical imputation:

impute

1.4 PCA analysis

Follow the below steps to perform PCA analysis:

pca

1.5 perform GLM

Run the GLM analysis by selecting the intersected files following the steps below:

GLM

2.0 Convert Genotype calls into Numeric format in TASSEL

To convert your genotype data (A,C,G,T) into numeric format (0, 0.5, 1) follow the below steps:

Numeric Format


--- End of Tutorial ---

Thank you for reading this tutorial. If you have any questions or comments, please let me know in the comment section below or send me an email.

Bibliography

Bradbury PJ, Zhang Z, Kroon DE, Casstevens TM, Ramdoss Y, Buckler ES. (2007) TASSEL: Software for association mapping of complex traits in diverse samples. Bioinformatics 23:2633-2635.