Replies: 1 comment 3 replies
-
The questions you enumerate under point 3 are the main reason that there is no publicly available large-scale data of this kind. Some projects collect tens of thousands of genomes, but all that data is under restricted access. The first step would be to join a group that is allowed to access that data. |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hello everyone,
I am a final-year CS student interested in bioinformatics. I am exploring an idea for predicting disability/disease risks in offspring before conception by analyzing parental genome patterns and junk DNA (non-coding regions) using AI/ML.
What I want to know:
1️⃣ Is it currently feasible to use non-coding/junk DNA mutation patterns, combined with machine learning, to predict the risk of diseases or disabilities pre-conception?
2️⃣ What datasets, resources, or pipelines would you recommend to start learning and experimenting in this direction practically?
3️⃣ Are there ethical or privacy considerations I should deeply understand before pursuing this line of research?
I am looking for practical advice to start small, recommended tools, papers, or courses, and insights on the current limitations in using junk DNA for disease prediction.
This is a dream research direction for me, and I want to build my skills responsibly while exploring this impactful idea.
Thank you for your time and any guidance you can share!
Warm regards,
Gayathri
Beta Was this translation helpful? Give feedback.
All reactions