Skip to content

Finding proxy SNPs in the outcome data when using the "read_outcome_data" function. #407

Description

@nicholaspudjihartono

Please make sure that this is a feature request! If you have questions about how to use TwoSampleMR please use the Discussions function instead.

Let's say I have "SNP1" in my exposure data, but "SNP1" is not present in my local outcome data (i.e., the outcome GWAS was not taken from the "available_outcomes()" function). As the outcome data is local, we cannot use the "extract_outcome_data" function to automatically find proxy SNPs. Instead, you have to use the "read_outcome_data" function, which currently does not have the option to automatically search for proxy SNPs.

Therefore, I wonder if there is a way to find proxy SNPs in the outcome data using the "read_outcome_data" function , or maybe this feature should be implemented inside the "read_outcome_data" function.

This is crucial because some diseases like Juvenile Idiopathic Arthritis, do not have available GWAS with substantial number of samples in the IEU GWAS database (which is where the "available_outcomes()" function get their data). So I prefer to download the summary statistics of a GWAS that I prefer from the GWAS catalog. However doing this means that I cannot use the "extract_outcome_data" function which handles proxy SNP automatically.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions