Kevin Meza Landeros

Update README.md

......@@ -19,7 +19,7 @@ Thats why, our hypothesis is that a predictive model can determine the GCs of th
## Metodolgy
1. GEO files download
1. __*GEO files download*__
GEO files from all Entero bacteria were downloaded to a server and ordered in 4 directorie (all of them with lots of _GSE00000000_ folders):
- Binding_exp
- Binding_HT
......@@ -27,12 +27,12 @@ Thats why, our hypothesis is that a predictive model can determine the GCs of th
- Function_HT
Each of the _GSE00000000_ folders contains a compresed file (GSE00000_family.soft.gz) that must be extracted.
2.- Obtaining SOFT files and its transformation to an XML format
2. __*Obtaining SOFT files and its transformation to an XML format*__
An script goes trhough every _GSE00000000_ folder an unzips _"GSE00000_family.soft.gz"_ files, in order to obain _"GSE00000_family.soft"_ files.
These last are all saved in another directory, keeping the structure of the 4 father directories.
Then another script transforms SOFT files into XML files.
3. Tagging the GC within the XML files
3. __*Tagging the GC within the XML files*__
## Prerequisites
### Programming languages
......