-
Notifications
You must be signed in to change notification settings - Fork 41
How to repeat file Xreducedall.2002.npy for another organisms #23
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
For generating the equivalent of Xreducedall.2002.npy for new organisms, you will need to first train a sequence model to predict chromatin profiles in the organism of interest for you first, or use an existing model such as DeepArk (https://www.ncbi.nlm.nih.gov/pubmed/33888512/). It is important that Xreducedall.2002.npy is not computed from NarrowPeak files (as you mentioned in your other post), but is computed from sequence model predictions from sequences centered at major TSS for genes. Once you have the sequence model predictions, codein this discussion should be helpful for generating the equivalent of Xreducedall.2002.npy #9 |
Hi, |
If you want to train new sequence models for epigenetics data, feel free to check out https://github.com/FunctionLab/selene (there are tutorials and manuscript examples provided). Note that for ExPecto model there are two steps of training that are needed. First, you train the chromatin profiles sequence model, which will allow you to generate the equivalent Xreducedall.2002.npy, then you can modify the ExPecto script to do the second step of training for expression prediction. |
Hi, |
Hi,
Could you provide the python script to produce Xreducedall.2002.npy file, pls?
Maybe we can use the pipeline for another organisms analysis, such as FLY. Thansk in advance!!!
Best Wishes,
The text was updated successfully, but these errors were encountered: