|
@@ -15,7 +15,7 @@ CALDER is a Hi-C analysis tool that allows: (1) compute chromatin domains from w
|
|
|
|
|
|
## Introduction of opitimized `bin_size` selection
|
|
|
|
|
|
-We added an opitimized `bin_size` (equivalent to `resoltution` in the literature) selection strategy for the purpose of calling reliable compartments at high resolution. This is based on the observation from our large scale compartment analysis that, although compartments can change between different conditions, their overall consistency (measured by `cor(compartment_rank_1, compartment_rank_2)` is high (> 0.4). Due to reasons such as low data quality or large scale structrual variation, compartments can be unrealiablly called at one `bin_size` but can be captured at another `bin_size`.
|
|
|
+Due to reasons such as low data quality or large scale structrual variation, compartments can be unrealiablly called at one `bin_size` but can be captured at another `bin_size`. We added an opitimized `bin_size` (equivalent to `resoltution` in the literature) selection strategy for the purpose of calling reliable compartments at high resolution. This is based on the observation from our large scale compartment analysis that, although compartments can change between different conditions, their overall consistency (measured by `cor(compartment_rank_1, compartment_rank_2)` is high (> 0.4). Due to reasons such as low data quality or large scale structrual variation, compartments can be unrealiablly called at one `bin_size` but can be captured at another `bin_size`.
|
|
|
<br>
|
|
|
<br>
|
|
|
We define the consistency as , and choose the smallest `bin_size` such that no bigger `bin_size` can increase the consistency more than 0.05. For example, if consistency for `bin_size=10000` is 0.2 while for `bin_size=50000` is 0.6, we are more confident the latter is more reliable; if consistency for `bin_size=10000` is 0.5 while for `bin_size=50000` is 0.52, we would choose the former as it has higher resolution.
|