Skip to content
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Original file line number Diff line number Diff line change
Expand Up @@ -209,6 +209,38 @@ Selecting the right genomic dataset is the foundational step that determines the

In the next article of this series, "From Dataset to Discovery," we will put this framework into practice, demonstrating how to apply these pillars to real-world study designs including GWAS, PRS validation, and rare variant analysis.


::: {.article-btn}
[Explore more Applied Insights](https://https://realworlddatascience.net/applied-insights/)
:::

::: {.further-info}
::: grid

::: {.g-col-12 .g-col-md-12}
About the author:
: [Alieyeh Sarabandi Moghaddam](https://uk.linkedin.com/in/alieyeh-sarabandi-moghaddam) is a Genomic Data Scientist at Dementias Platform UK (DPUK). With a background in computer engineering and an MSc in Health Data Science (Genomics) from the University of Exeter, she conducts research in statistical genetics and multi-omics integration, while also designing reproducible bioinformatics workflows and secure research infrastructure. Her work focuses on helping researchers navigate complex genomic datasets through practical frameworks and governance-aware data provisioning.
: [Fatemeh Torabi](https://uk.linkedin.com/in/fatemeh-torabi-909190b3) is an Assistant Professor in Healthcare Data Science at the University of Cambridge, where her research develops statistical methods for risk prediction and treatment optimisation in long-term conditions. She co-directs the Master of Studies in Genomic Medicine and has led the creation of a global MicroMasters in healthcare data science. Her work focuses on translating methodological innovation into real-world healthcare improvements through secure, equitable data access.
: [Emma Squires](https://uk.linkedin.com/in/emma-squires-977b6717a)is Chief Operating Officer for Dementias Platform UK (DPUK) and Head of Programmes and Innovation for the UK Secure Research Platform (SeRP), specialising in the operational design, governance, and delivery of Trusted Research Environments (TREs) for national health data programmes. She co-chairs the Synthetic Data Working Group and has co-authored AI governance frameworks for TREs, with a focus on building durable systems that maintain public trust and regulatory confidence.
: [Kenneth Langlands](https://uk.linkedin.com/in/kennylanglands) has a BSc in genetics and a PhD in cancer biology from the University of Edinburgh. Following post-doctoral research posts in Bristol, Pittsburgh and Cambridge, he went on to combine a career in bioinformatics with medical education. Dr Langlands returned to Cambridge in September of 2023 to become course director of the Master of Studies in Genomic Medicine at Cambridge.
:::

::: {.g-col-12 .g-col-md-6}
**Copyright and licence** : © 2026 Annie Flynn
<a href="http://creativecommons.org/licenses/by/4.0/?ref=chooser-v1" target="_blank" rel="license noopener noreferrer" style="display:inline-block;">
<img style="height:22px!important;vertical-align:text-bottom;" src="https://mirrors.creativecommons.org/presskit/icons/cc.svg?ref=chooser-v1">
<img style="height:22px!important;margin-left:3px;vertical-align:text-bottom;" src="https://mirrors.creativecommons.org/presskit/icons/by.svg?ref=chooser-v1">
</a>
This article is licensed under a Creative Commons Attribution 4.0 (CC BY 4.0)
<a href="http://creativecommons.org/licenses/by/4.0/?ref=chooser-v1" target="_blank" rel="license noopener noreferrer" style="display:inline-block;">International licence</a>.
:::

::: {.g-col-12 .g-col-md-6}
**How to cite** :
Moghaddam, Alieyeh Sarabandi; Torabi, Fatemeh; Squires, Emma; and Langlands, Kenneth “**Choosing the Right Genomic Dataset: A Five-Pillar Framework for Researchers**” *Real World Data Science*, 2026. [URL](https://realworlddatascience.net/applied-insights/tutorials/posts/2026/05/21/genomic-data-sets-guide.html)
:::


## References
1\) Bahcall O. G. (2021). In this issue: GA4GH standards enable the responsible sharing of human genomic and biomedical data. *Cell Genomics*, *1*(2), 100038. [https://doi.org/10.1016/j.xgen.2021.100038](https://doi.org/10.1016/j.xgen.2021.100038)

Expand Down
Loading