System Grab Bag

View all TLDR pages from common (or from all pages)

bedtools

A swiss-army knife of tools for genomic-analysis tasks. Used to intersect, group, convert and count data in BAM, BED, GFF/GTF, VCF format. More information: https://bedtools.readthedocs.io.
  • Intersect two files regarding the sequences' strand and save the result to the specified file:
    bedtools intersect -a {{path/to/file_1}} -b {{path/to/file_2}} -s > {{path/to/output_file}}
  • Intersect two files with a left outer join, i.e. report each feature from {{file_1}} and NULL if no overlap with {{file_2}}:
    bedtools intersect -a {{path/to/file_1}} -b {{path/to/file_2}} -lof > {{path/to/output_file}}
  • Using more efficient algorithm to intersect two pre-sorted files:
    bedtools intersect -a {{path/to/file_1}} -b {{path/to/file_2}} -sorted > {{path/to/output_file}}
  • Group file {{path/to/file}} based on the first three and the fifth column and summarize the sixth column by summing it up:
    bedtools groupby -i {{path/to/file}} -c 1-3,5 -g 6 -o sum
  • Convert bam-formatted file to a bed-formatted one:
    bedtools bamtobed -i {{path/to/file}}.bam > {{path/to/file}}.bed
  • Find for all features in {{file_1}}.bed the closest one in {{file_2}}.bed and write their distance in an extra column (input files must be sorted):
    bedtools closest -a {{path/to/file_1}}.bed -b {{path/to/file_2}}.bed -d

License and Disclaimer

The content on this page is copyright © 2014—present the tldr-pages team and contributors.
This page is used with permission under Creative Commons Attribution 4.0 International License.

While we do attempt to make sure content is accurate, there isn't a warranty of any kind.