Skip to content

Latest commit

 

History

History
10 lines (5 loc) · 692 Bytes

Assignment_info.md

File metadata and controls

10 lines (5 loc) · 692 Bytes

LSTM for gene fusions classification -#3

Provided data: list of ~ 2000 gene fusions sequences in ATCG format (nitrogenous bases)

  1. Create a new dataset translating the above mentioned sequences into protein alphabet

  2. Create a Long Short Term Memory (LSTM) classifier able to classify gene fusions into Oncogenic and NotOncogenic. You have to build two classifiers, one with the dataset provided by us and one with the dataset you have built at step 1.

  3. Implement a bidirectional LSTM classifier able to classify gene fusions into Oncogenic and NotOncogenic. You have to build two classifiers, one with the dataset provided by us and one with the dataset you have built at step 1.