DDR scores were imported from a perviously cleaned version of the data from Knijnenburg et al. DDR scores were available for 6440 samples.
Ancestry estimates were imported from another github repo. Ancestry estimates from the same tissue type were merged with DDR scores. The final dataset contains 6440 samples.
DDR scores by dominant ancestral populations and stratified by cancer type.
Linear model for DDR score by dominant population (ref = “EUR”) and adjusting for cancer type.
| term | estimate | std.error | statistic | p.value |
|---|---|---|---|---|
| AFR | 0.7180697 | 0.1291673 | 5.5592217 | 0.0000000 |
| AMR | -0.1106005 | 0.2275965 | -0.4859497 | 0.6270195 |
| EAS | 0.7397770 | 0.1536566 | 4.8144816 | 0.0000015 |
| SAS | 0.9912771 | 0.4144066 | 2.3920398 | 0.0167837 |
The previous figure and model were run using the dominant population from blood derived normal samples, rather than the sample that DDR was based off. Only about 10% of samples vary in dominant population between sample types so changes are minimal.
DDR scores by dominant ancestral populations (in normal) and stratified by cancer type.
| term | estimate | std.error | statistic | p.value |
|---|---|---|---|---|
| AFR | 0.6521750 | 0.1380004 | 4.7258927 | 0.0000023 |
| AMR | 0.0708100 | 0.2639148 | 0.2683065 | 0.7884737 |
| EAS | 0.7827773 | 0.1666662 | 4.6966757 | 0.0000027 |
| SAS | 0.9681139 | 0.4343137 | 2.2290662 | 0.0258506 |