how_to

Molecular data available in GDC

2020-07-30
𝕋he NCI's Genomic Data Commons (GDC) provides the cancer research community with a unified data repository that enables data sharing across cancer genomic studies in support of precision medicine. The GDC supports several cancer genome programs at the NCI Center for Cancer Genomics (CCG), including … Read more →

Querying BigQuery

2020-07-30
𝕗ind examples here https://hakyimlab.github.io/bigquery-covid19/query-phenomexcan.html […] worflowr code https://github.com/hakyimlab/bigquery-covid19 Read more →

Hands-On Training: plink

2020-07-14
𝕃ATEST VERSION IN https://bios25328.hakyimlab.org/post/2021/04/07/plink-tutorial/ […] A more detailed tutorial for GWAS analysis is here. […] We will learn to run a GWAS using plink. This tutorial follows a plink tutorial you can find here […] … Read more →

Using R_Markdown

2020-07-08 Ian Waters
ℝ Markdown is a authoring framework that allows for reproducible documentation of data science within the context of R Studio. This is an introduction designed to teach you how to use R Markdown in a few minutes. Further info and some of the examples used below can be found here: … Read more →

Hands-On Training: R

2020-07-05 Laura Vairus
𝕊wirl is a great and easy way to get you started with R. Install and open it by clicking the green arrow on the right. Install Swirl Package: install.packages("swirl") library("swirl") swirl() Once you enter your username, it prompts you to choose a course to install. Choose 1 … Read more →

Hands-On Training: Command Line

2020-07-02 Laura Vairus
𝕀n this tutorial, we will learn some basic Unix/Linux commands to perform tasks in the command line. The command line is an interface that allows you to store, manage, and process data. Terminal is an app that gives you access to the command line. (Note: This tutorial was made for Mac users. For … Read more →

Converting Fusion weight to PredictDB format

2020-06-22 Sabrina Mi
𝕀f you want to use the MetaXcan suite of tools, you will need to format your prediction weights in sqlite format. […] Find a sample script that formats FUSION weights to predictdb format here […] If you want to use the summary version of PrediXcan, you will also need the covariances … Read more →

IntroStatGen R Studio Servers using Google Cloud

2020-06-16 Owen Melia
𝔽or the one-day seminar, we had a hands-on lab where we decided we needed to set up R Studio Servers. The servers needed pre-loaded data, access to a terminal, pre-compiled binaries for torus and fastenloc, and the correct python/R/Linux environments to run all of our analyses. Here’s a guide … Read more →

dbGaP Project Renewal

2001-07-30
𝕋he email reminder to renew dbGaP projects should link to My Projects tab after logging in, where you can select the project to request renewal. For the most part, the application will walk through the steps if you click ‘save and continue’ at the bottom of each page, but there are tabs … Read more →

GWAS on ANL Servers

2001-07-30
𝕊teps to running the GWAS on ANL’s servers. […] This was done by Tom Brettin. Hail is working on the nucleus machine, but not washington at the moment. […] Downstream analysis can be a lot faster with a smaller genetic dataset. We filtered for individuals in the brain imaging … Read more →