The number of genes and transcripts we have in Ensembl can make your VEP results very big. Filtering your results after running the VEP is the best way to make this more manageable, but you can also reduce the results in your run itself, to only get one result per variant or variant/gene combo.
You can restrict your results in either the offline or the online tools. The options on the offline tool are shown in the documentation, whereas the online options are in the Filtering options section of the input form.
When running offline, you can also take advantage of the ‘flag’ options to mark the lines which meet your selection criteria while printing the full output. This makes it trivial to extract the subset of results you are most interested in, but provides full results in case you need to dig deeper.
How the rules work is easiest to understand with an example variant. In this case our variant has two possible alternative alleles, hits three transcripts of Gene1, falling in a coding exon of two of them, and one transcript of the non-coding Gene2. The A allele is synonymous, whereas the C allele causes a missense change, altering the amino acid.
The full set of options are summarised here:
Offline option | Online option | Rule | Example |
–most_severe | Show most severe consequence per variant | Uses the severity ranking of the variant consequences to choose the most severe. This erases information about the individual transcripts. | C, missense, Gene1 |
–summary | Show only a list of consequences per variant | Instead of giving you one line for each consequence, each variant gets one line in the table, with all the consequences shown. | All the consequences |
–pick | Show one selected consequence per variant | Ranks all the transcripts (including those of different genes) and consequences by a set of criteria, and displays only the consequence on the top-ranking transcript. Where two are equally ranked, the choice is arbitrary | C, missense, Gene1-1 |
–pick_allele | Show one selected consequence per variant allele | As –pick, but if there are multiple alternative alleles, shows one consequence for each | C, missense, Gene1-1
A, synonymous, Gene1-1 |
–per_gene | Show one selected consequence per gene | As –pick, but does the analysis separately for each gene, so you have one consequence per gene. | C, missense, Gene1-1
C, non_coding_transcript_variant, Gene2-1 (the A allele could just as easily have been selected as they rank the same) |
–pick_allele_ gene | A combination of –pick_allele and –per_gene. | C, missense, Gene1-1
A, synonymous, Gene1-1 C, non_coding_transcript_variant, Gene2-1 A, non_coding_transcript_variant, Gene2-1 |
|
–flag_pick | Displays all the consequences, but adds a flag to the consequence chosen by –pick. | ||
–flag_pick_ allele | Displays all the consequences, but adds flags to the consequences chosen by –pick_allele. | ||
–flag_pick_ allele_gene | Displays all the consequences, but adds flags to the consequences chosen by –pick_allele_gene. | ||
–pick_order | Use with any of the –pick options and change the order of the criteria. |