Yuanlong LIU 2 rokov pred
rodič
commit
a6bb44eb68
2 zmenil súbory, kde vykonal 1 pridanie a 1 odobranie
  1. BIN
      .DS_Store
  2. 1 1
      README.md

BIN
.DS_Store


+ 1 - 1
README.md

@@ -15,7 +15,7 @@ CALDER is a Hi-C analysis tool that allows: (1) compute chromatin domains from w
 
 ## Introduction of opitimized `bin_size` selection
 
-Due to reasons such as low data quality or large scale structrual variation, compartments can be unrealiablly called at one `bin_size` but can be captured at another `bin_size`. We added an opitimized `bin_size` (equivalent to `resoltution` in the literature) selection strategy for the purpose of calling reliable compartments at high resolution. This is based on the observation from our large scale compartment analysis that, although compartments can change between different conditions, their overall consistency (measured by `cor(compartment_rank_1, compartment_rank_2)` is high (> 0.4). Due to reasons such as low data quality or large scale structrual variation, compartments can be unrealiablly called at one `bin_size` but can be captured at another `bin_size`. 
+Due to reasons such as low data quality or large scale structrual variation, compartments can be unrealiablly called at one `bin_size` but can be captured at another `bin_size`. We added an opitimized `bin_size` (equivalent to `resoltution` in the literature) selection strategy for the purpose of calling reliable compartments. This is based on the observation from our large scale compartment analysis that, although compartments can change between different conditions, their overall consistency (measured by `cor(compartment_rank_1, compartment_rank_2)` is high (> 0.4). Due to reasons such as low data quality or large scale structrual variation, compartments can be unrealiablly called at one `bin_size` but can be captured at another `bin_size`. 
 <br>
 <br>
 We define the consistency as , and choose the smallest `bin_size` such that no bigger `bin_size` can increase the consistency more than 0.05. For example, if consistency for `bin_size=10000` is 0.2 while for `bin_size=50000` is 0.6, we are more confident the latter is more reliable; if consistency for `bin_size=10000` is 0.5 while for `bin_size=50000` is 0.52, we would choose the former as it has higher resolution.