Larisa Morales Soto

Merge branch 'master' of pakal.ccg.unam.mx:larisams/gene-disease-embeddings

Showing 1 changed file with 17 additions and 0 deletions
1 +# Construction of abstract titles embeddings
2 +
3 +The main goal of this proyect is to obtain sentence embeddings of 450 abstract titles, apply hierarchical clustering to the resulting vectors and analyze the characteristics of the sentences from each group. We expect that sentences indicating an association between a gene and a disease end up clustered together.
4 +
5 +
6 +### Input data sets
7 +```Shell
8 +\corpora
9 +```
10 +### Sentence embeddings
11 +```Shell
12 +\embeddings
13 +```
14 +### Programs and commands used
15 +```Shell
16 +\scripts
17 +```
...\ No newline at end of file ...\ No newline at end of file