MR-Base
@mrbase2
Followers
661
Following
130
Media
5
Statuses
110
A platform for Mendelian randomisation & PheWAS using results from GWAS. Includes API, web interface and R packages https://t.co/JdnVtX9S0t
Bristol, England
Joined October 2016
Great thread on MR-Base! includes description of my recent updates
We describe the design and scope of @mrbase2 ( https://t.co/vyutdC7nuH): a platform that integrates a database of complete GWAS results (no restrictions according to p-values) with an API, webapp and R packages that automate Mendelian randomization (MR) using GWAS results 2/n
0
2
3
New postdoc opportunity in Health Data Science at the MRC Integrative Epidemiology Unit (@mrcieu) at University of Bristol. Get in touch if interested!
0
26
20
Join me at #ElasticCC where I'll be talking about how we use Elasticsearch for @OpenGwas. This is @Elastic’s free technical event from the community, for the community — happening from Feb 26 - 27 https://t.co/ifxbscFHMW
0
10
18
We've made some major changes to mrbase and the twosamplemr R package. Details here:
0
1
6
Were experiencing problems with the mrbase web app at the moment. You can continue to access the data via https://t.co/YGA8VTCVQf and @OpenGwas while we work on a fix
0
0
5
MR-Base was down due to server outage with our cloud hosting provider. Should be working now!
0
0
1
Analytical tools - The gwasglue R package connects the local or cloud data sources, as well as cloud hosted LD reference panels, to a range of third party tools. Fine mapping, colocalisation, MR, genetic correlations, PheWAS, etc.
1
3
6
Local querying - All the data can also be downloaded as GWAS VCF files. There is a need for a storage format that is unambiguous, self contained and performant for querying. We designed this for large scale analysis on compute clusters
biorxiv.org
Genome-wide association study (GWAS) summary statistics are a fundamental resource for a variety of research applications [1][1]–[6][2]. Yet despite their widespread utility, no common storage format...
1
3
3
Cloud querying - elasticsearch to store the data, hosted in collaboration with Oracle Cloud Infrastructure. We worked hard to optimise queries, allowing rapid search by position, rsid, range, p-value. This can be done in R, python or through a RESTful API.
1
3
3
QC process - we align the non-effect allele to the human genome reference sequence; and annotate the positions with dbSNP identifiers. Example QC report:
2
3
3
Data sources - To date, the summary data have been manually harvested/pooled from various GWAS consortia and biobanks. Many thanks to these studies who have made their summary data available!
1
3
3
OpenGWAS ( https://t.co/M3WhUqb9Gr) is a harmonised database of complete GWAS summary data (126 billion associations, almost 15000 complete datasets) with programmatic connections to a range of third party analytical tools.
1
3
4
Continued data harvesting - for new GWAS results that are published, upload them to the EBI GWAS catalog and we will pull them in from there. For unpublished results, or large batches, we have pipelines to do that. Get in touch here:
github.com
A place to request or contribute data to the IEU GWAS database - MRCIEU/opengwas-requests
1
3
3
Huge thanks to @The_MRC, @OracleCloud and @wellcometrust for funding much of this, and team (@explodecomputer @tomrgaunt @elswob @matt_s_lyon @prime2 @assetweb @JonJHallett and others) for making a hopefully useful community resource
0
3
6
We built the OpenGWAS resource to be a free and open platform to support work on GWAS summary data. Much of it is based on extensive feedback on @mrbase2 from both internal and external colleagues. Paper here: https://t.co/pyXRTuhynU, key points below:
biorxiv.org
Data generated by genome-wide association studies (GWAS) are growing fast with the linkage of biobank samples to health records, and expanding capture of high-dimensional molecular phenotypes....
1
17
19
Paper from Matt Lyon @mrc_ieu describing an adapted variant call format (VCF) for efficient and robust storage of GWAS summary statistics
biorxiv.org
Genome-wide association study (GWAS) summary statistics are a fundamental resource for a variety of research applications [1][1]–[6][2]. Yet despite their widespread utility, no common storage format...
1
7
13
EpiGraphDB v0.3.0 is released today, with ~10x expansion of data to support #triangulation of #epidemiology evidence and #datamining. Check out the latest and greatest features at https://t.co/zPB8zJgbAM!
0
3
1
Guidelines on performing Mendelian randomization investigations written by an all-star line-up of MR researchers are now available on Wellcome Open Research: https://t.co/tdg3JHX58K - represents a consensus statement after 12+ months of deliberation. Comments welcome!
3
70
141
The @mrc_ieu GWAS database is now being hosted on Oracle Cloud. Should see some major performance improvements. Access it now via @mrbase2 and https://t.co/8AOFCRVX54. Big thanks to @OracleCloud and @philrbates
github.com
R package for performing 2-sample MR using IEU OpenGWAS database - MRCIEU/TwoSampleMR
0
10
14
Phenome-wide Mendelian randomization mapping the influence of the plasma proteome on complex diseases https://t.co/tS7lt5cnKX New MR paper suggest proteins with MR and colocalization evidence is more likely to predict drug trial success
biorxiv.org
The human proteome is a major source of therapeutic targets. Recent genetic association analyses of the plasma proteome enable systematic evaluation of the causal consequences of variation in protein...
2
22
61