Showing
1 changed file
with
13 additions
and
2 deletions
... | @@ -7,10 +7,21 @@ Méndez Cruz Carlos Francisco | ... | @@ -7,10 +7,21 @@ Méndez Cruz Carlos Francisco |
7 | **Members** | 7 | **Members** |
8 | Gaytan Nuñez Estefani | 8 | Gaytan Nuñez Estefani |
9 | Meza Landeros Kevin Emmanuel | 9 | Meza Landeros Kevin Emmanuel |
10 | -Tierrafría Victor | 10 | +Tierrafría Victor _(curator)_ |
11 | + | ||
12 | +## Main Purpose | ||
13 | +As it is known, GEO Database is home to thousands of High-Throughput (HT) genetic expresison experiments. The documentation for each experiment done within the database includes the Growth Conditions (GC) used in it. Unfurtunately they are not registered in a structured way, but they are into text fragments associated with various fields (we call them metadata). | ||
14 | +Since knowing the GCs of these experiments helps to better understand genetic regulation, it becomes important to extract these conditions. However, doing it manually requires a lot of | ||
15 | +effort on large data sets. | ||
16 | + | ||
17 | +Thats why, our hypothesis is that a predictive model can determine the GCs of thousands of experiments stored in the GEO. Our goal is to generate a report, that will be used by curators to review and validate the GC of the experiments. | ||
18 | + | ||
19 | + | ||
11 | 20 | ||
12 | ## Metodolgy | 21 | ## Metodolgy |
13 | - 1. Files were | 22 | + 1. GEO files download |
23 | + 2. Obtaining SOFT files and its transformation to an XML format | ||
24 | + 3. Tagging the GC within the XML files | ||
14 | 25 | ||
15 | ## Prerequisites | 26 | ## Prerequisites |
16 | ### Programming languages | 27 | ### Programming languages | ... | ... |
-
Please register or login to post a comment