5. CpG_distrb_chrom.py¶
5.1. Description¶
This program calculates the distribution of CpG over chromosomes
5.2. Options¶
--version show program’s version number and exit -h, --help show this help message and exit -i INPUT_FILES, --input_files=INPUT_FILES Input CpG file(s) in BED3+ format. Multiple BED files should be separated by “,” (eg: “-i file_1.bed,file_2.bed,file_3.bed”). BED file can be a regular text file or compressed file (.gz, .bz2). The barplot figures will NOT be generated if you provide more than 12 samples (bed files). [required] -n FILE_NAMES, --names=FILE_NAMES Shorter and meaningful names to label samples. Should be separated by “,” and match CpG BED files in number. If not provided, basenames of CpG BED files will be used to label samples. [optional] -s CHROM_SIZE, --chrom-size=CHROM_SIZE Chromosome size file. Tab or space separated text file with two columns: the first column is chromosome name/ID, the second column is chromosome size. This file will determine: (1) which chromosomes are included in the final bar plots, so do NOT include ‘unplaced’, ‘alternative’ contigs in this file. (2) The order of chromosomes in the final bar plots. [required] -o OUT_FILE, --output=OUT_FILE The prefix of the output file. [required]
5.3. Input files (examples)¶
5.4. Command¶
$ chrom_distribution.py -i 450K_probe.hg19.bed3.gz,850K_probe.hg19.bed3.gz -n 450K,850K \
-s hg19.chrom.sizes -o chromDist
Output files
- chromDist.txt
- chromDist.r
- chromDist.CpG_total.pdf
- chromDist.CpG_percent.pdf
- chromDist.CpG_perMb.pdf
Total CpG count per chromosome
CpG percent on each chromosome (normalized to total CpGs)
CpG per Mb (normalized to chromosome size)