Correlation between genetic, geographic and linguistic distances of Balto-Slavic populations
A Mantel test was applied to compare the roles which geography and language have played in shaping the genetic variation of the Balto-Slavic populations (
Fig 5, Tables I,J in
S1 File). The test was performed independently for the three genetic systems, with all three exhibiting a very high correlation with geography (0.80–0.95) and slightly lower (0.74–0.78) correlation with linguistics (Table J in
S1 File). Because the linguistic pattern itself is highly correlated with geography (
Fig 5), partial correlations were considered to distinguish between the direct and indirect influences of geography on the two other systems. The correlations with linguistics became much lower whilst all three genetic systems maintained high correlations with geography (Table J in
S1 File).
Fig 5
Correlations between matrices of genetic, geographic and linguistic distances among Balto-Slavic populations.
Discussion
Two major genetic substrata are embedded in the gene pools of Slavs
The results of our study have shown the close genetic proximity of the majority of West and East Slavic populations inhabiting the geographic area from Poland in the west, to the Volga River in the East (
Fig 2A and 2B, Tables A,B in
S1 File). Some mtDNA haplotypes of hgs H5, H6, U4a were more frequent in the genomes of West and East Slavic speakers, providing thereby further evidence for the matrilineal unity of West and East Slavs [
28,
36] as well as continuity of mtDNA diversity in the territory of modern Poland for at least two millennia [
38].
In contrast to this apparent genetic homogeneity of the majority of West and East Slavs, the gene pool of South Slavs, who are confined to the geographically smaller Balkan Peninsula, differs substantially and shows internal differentiation, as testified by their NRY and autosomal variation (
Fig 2A and 2B;
Fig 3, Tables A,B in
S1 File). Consequently, we suggest that there is a “central-east European” genetic substratum in West and East Slavs, exemplified by NRY hgs R1a and the k3 ancestry component, and a “south-east European” one, featuring NRY hgs I2a and E plus the k2 ancestry component for South Slavs (
Fig 2A and 2B,
Fig 3, Table K in
S1 File; Tables A,B in
S1 File). Notably, the “south-east European” component does not extend to the whole Balkan Peninsula, as South Slavs are differentiated from Greek sub-populations except Macedonian Greeks (
Fig 2A,
Fig 4B) [
55].
The importance of these substrata in shaping the genetic diversity of the present-day Slavs is evident from the observed lower IBD relatedness between the combined group of East-West Slavs and South Slavs than with north-east Europeans, including Baltic speakers (
Fig 4A). The latter reside within the East European Plain and presumably represent the “central-east European” pre-Slavic substratum (
Fig 4A, Table G in
S1 File). AMOVA results also support the substrata prevalence, because genetic variation among Slavic branches (which assimilated different substratum populations) strongly exceeds intra-branch variation (Table H in
S1 File). The influence of geography in shaping the Slavic genetic heritage (
Fig 5, Table J in
S1 File) led to the same conclusion, because if substratum importance is the major factor shaping the genetic relationships among present-day Slavic-speaking populations, these will not reflect the relationship among expanding Slavic languages, but should instead reflect the relationships between pre-Slavic populations, which can be approximated by geographical distances between them.
Demographic mechanisms shaping the gene pool of Slavic speakers
Most West and East Slavs of Central-East Europe form genetically a compact group of populations that, as a general rule, differ from their western (Germanic-speaking) and eastern (Finno-Ugric-speaking) neighbors (
Fig 2A and 2B;
Fig 4A and 4B). However, so-called ‘contact’ zones of this group with non-Slavic peoples are characterized by various patterns of genetic clines or sharp genetic borders [
27,
32,
56–
58]. For example, there is a pronounced genetic proximity between Czechs and their immediate Germanic neighbors in the west (
Fig 2A and 2B,
Fig 3) [
27,
58] that could be attributed to the pre-Slavic gene pool formation of Central-East Europeans. In contrast, a clear genetic border exists nowadays between Poles and their immediate western neighbors Germans, and even between a West-Slavic-speaking minority–Sorbs–and their German host population (
Fig 2B, Tables A,B in
S1 File) [
43,
59]. It has been suggested, that this genetic boundary predates massive resettlements of people after World War II, and could have been shaped during medieval migrations of Germanic and Slavic peoples in the Vistula and Oder River basins [
60]. In the north-east, a largely autochthonous (pre-Slavic) component is detected in the gene pool of Russians from northern regions of the European part of Russia (
Fig 2A, 2B and 2C,
Fig 3), which agrees with previous anthropological [
61,
62] and genetic [
32,
45,
56,
63] studies and supports substantial admixture of expanding Slavs with indigenous populations and, perhaps, language shift in the latter.
Taken together, several mechanisms including cultural assimilation of the autochthonous populations by expanding Slavs while maintaining the pre-Slavic genetic boundaries, and
in situ gene pool shaping, are needed to explain the genetic patterns observed on the eastern, north-eastern and western margins of the current ‘Slavic area’ within Central-East Europe.
The presence of two distinct genetic substrata in the genomes of East-West and South Slavs would imply cultural assimilation of indigenous populations by bearers of Slavic languages as a major mechanism of the spread of Slavic languages to the Balkan Peninsula. Yet, it is worthwhile to add here evidence from the analysis of IBD segments: the majority of Slavs from Central-East Europe (West and East) share as many IBD segments with the South Slavs in the Balkan Peninsula as they share with non-Slavic populations residing nowadays between Slavs (
Fig 4A and 4B; Table G in
S1 File). This even mode of IBD sharing might suggest shared ancestry/gene flow across the wide area and physical boundaries such as the Carpathian Mountains, including the present-day Finno-Ugric-speaking Hungarians, Romance-speaking Romanians and Turkic-speaking Gagauz. A slight peak at 2–3 cM in the distribution of shared IBD segments between East-West and South Slavs (
Fig 4A and 4B) might hint at shared “Slavonic-time” ancestry, but this question requires further investigation.
Expansion of Slavic languages took place in an area already occupied by speakers of the Baltic languages [
49,
50]. Despite significant linguistic divergence between extant East Baltic and Slavic languages (
Fig 1) [
7], Baltic populations are genetically the closest to East Slavs (
Fig 2A and 2B, Table K in
S1 File) [
45,
64–
66] and here we found that they bear the highest number of shared IBD segments with the combined group of East-West Slavs (
Fig 4, Table G in
S1 File). The presence of a substantial “Baltic substratum” in the genomes of extant Slavs within East Europe might in part explain their genetic closeness to each other and difference from some neighboring non-Slavic groups.
A synthesis
Comparing genetic and linguistic reconstructions with geography has a long tradition in human population genetics [
67]. Here, we have studied the autosomal, NRY and mtDNA diversity of all Balto-Slavic populations in the context of their linguistic variation and geography. A remarkable agreement between these five systems was found: correlation coefficients range from 0.68 to near the maximum (0.95). This agreement between datasets from different systems supports the reliability of the results and in most cases, when drawing a conclusion, we could find one supported by the majority of the systems analyzed. In particular, we found that autosomal and NRY compositions and geographic affiliations of the Balto-Slavic populations form a triad, all variables of which are very similar to each other.
Combining all lines of evidence, we suggest that the major part of the within-Balto-Slavic genetic variation can be primarily attributed to the assimilation of the pre-existing regional genetic components, which differed for West, East and South Slavic-speaking peoples as we know them today.
Go to:
Materials and Methods
Ethics Statement
The DNA samples analysed in the study were collected after having obtained written informed consent. The procedure has been approved by Ethics Committees of the appropriate Institutions, including the Research Ethics Committee of the University of Tartu (UT REC) (no 225/T-9) and the Ethics Committee of the Research Centre for Medical Genetics, Russian Academy of Sciences.
Datasets
Three datasets NRY, mtDNA and autosomal SNP representing populations speaking Balto-Slavic languages were assembled.
The NRY data comprises 6,079 samples, including 1,254 reported here for the first time and 1,138 samples updated from previous work (Table L in
S1 File).
The mtDNA data include 6,876 samples, 917 are reported here for the first time (Table C in
S1 File).
The autosomal SNP data include 1,297 worldwide individuals including 70 reported here for the first time (Table M in
S1 File); this dataset encompasses in total 296 samples representing Balto-Slavic populations.
S1 Text: Datasets provides extended information on dataset assemblage. All samples reported here for the first time were collected after informed consent was obtained from each participant.
Genotyping
40 binary NRY markers were genotyped using the TaqMan (Applied Biosystems) technology as described [
68]. MtDNA analyses included HVS1 sequencing and genotyping of coding region SNPs defining mtDNA hgs [
69] (mtDNA tree Build 15 (30 Sep 2012). The autosomal SNP genotypes were generated with the Illumina 660K array and combined with published data (Table M in
S1 File).
S1 Text: Methods provides details about the autosomal SNP pre-processing performed before all analyses.
MDS, PCA and ADMIXTURE
MDS analysis based on genetic distances [
70] was performed for the NRY and mtDNA datasets (Tables C, K, N in
S1 File). PCA was performed for the autosomal dataset using the
smartpca program of the EIGENSOFT package [
71]; sets of Illumina-Affymetrix cross-platform SNPs (around 57k of LD-pruned SNPs), encompassing available Balto-Slavic populations, were used. Genomic ancestry components in Balto-Slavic speakers in the context of worldwide populations were inferred with ADMIXTURE [
51]; sets of only Illumina cross-platform SNPs (around 200k shared LD-pruned SNPs between the 610K, 650K and 660K arrays) were used (Table M in
S1 File). See
S1 Text: Methods for choosing the value of K which best models the ancestry components in our dataset.
Analysis of pairwise segments IBD
We aimed to compare the level of IBD relatedness between the combined group of East-West Slavs (group1)
vs South Slavs (group2) (i.e. IBD relatedness within Slavs) to the IBD relatedness between each group of Slavs
vs their respective neighboring groups of mostly non-Slavic populaitons (Table F in
S1 File lists populations in each group,
S3 Fig shows schematically the geographic location of each population groups). To this end we: a) calculated an average number of IBD segments per pair of individuals (ibd-statistic) between the group of East-West Slavs (group1) and South Slavs (group2), i.e. within-Slavic IBD sharing, and between each Slavic group and their respective geographic neighbors; b) compared the within-Slavs ibd-statistic with the ibd-statistics for each Slavic group and groups 3–9. The fast IBD (
fIBD) algorithm [
53] implemented in BEAGLE (
http://faculty.washington.edu/browning/beagle/beagle.html) was used to detect pairwise IBD segments. Sets of Illumina-only cross-platform SNPs (around 500k shared SNPs between the 610K, 650K and 660K arrays) were used in the analysis. See
S1 Text: Methods for detailed information about the experimental design and statistical approach applied.
AMOVA and Mantel tests
AMOVA (implemented in the Arlequin 3.11) was applied to estimate genetic differentiation when Balto-Slavic populations were grouped according to the three hierarchical levels of the tree of Balto-Slavic languages (
S1 Text: Methods, Table H in
S1 File,
S5 Fig). Mantel tests were performed in Arlequin 3.11 [
72] to calculate the coefficients of the pairwise and partial correlations between matrices of genetic (mtDNA, NRY and whole genome SNP), linguistic and geographic distances (Table I in
S1 File).
S1 Text: Methods provides additional details for Mantel tests analysis.
Lexicostatistical reconstruction of Balto-Slavic languages
20 wordlists of extant Balto-Slavic languages were used to reconstruct their phylogeny. The consensus tree (
Fig 1, Fig G in
S2 File) was drawn manually based on the set of trees produced by different phylogenetic methods. The method implying individual relative index of stability for each Swadesh item [
73,
74] was used for the node dating.
S2 File, Figs A-C in
S2 File, and Tables A,B in
S3 File contain detailed information about lexicostatistical reconstruction of the Balto-Slavic languages.
Supporting Information
S1 Dataset
(zip-archive).
- bslav.dbf, bslav.var, bslav.inf, lexical dataset in STARLING format (multistate matrix with synonyms allowed). This dataset exported in MS EXCEL format is available as Table A in S3 File.
- bslav.nex, the same dataset as a binary matrix in NEXUS format.
- *.tre, some of the discussed trees in NEWICK format;
- NEXUS files for NeighborNet networks.
(ZIP)
Click here for additional data file.(270K, zip)
S1 Fig
PC1vsPC2 plot based on whole genome SNP data (PC1 = 0.53; PC2 = 0.34).
(PDF)
Click here for additional data file.(56K, pdf)
S2 Fig
ADMIXTURE plot (k2-k20) (A). Box and whiskers plot of the cross validation (CV) indexes of all runs of the ADMIXTURE analysis (B). Log-likelihood (LL) scores of all runs (C). Variation in LL scores in the fractions (5%, 10%, 20% shown in dark green, middle green and light green, respectively) of runs that reached the highest LLs) (D).
(PDF)
Click here for additional data file.(2.3M, pdf)
S3 Fig
Schematic representation of groups of populations used in the IBD analysis.
Populations within each group are listed in Table F in
S1 File. Source of the Europe contour map:
http://www.conceptdraw.com/How-To-Guide/geo-map-europe.
(PDF)
Click here for additional data file.(116K, pdf)
S4 Fig
Distribution of the average number of IBD segments between group of East-West Slavs and their geographic neighbors.
Russians from Northern region of European part of Russia are considered separately from the group of north-east Europeans. The x-axis indicates ten classes of IBD segment length (in cM); the y-axis indicates the average number of shared IBD segments per pair of individuals within each length class.
(PDF)
Click here for additional data file.(34K, pdf)
S5 Fig
Hierarchical levels of genetic variation used in AMOVA.
(PDF)
Click here for additional data file.(250K, pdf)
S1 File
Table A in S1 File. Matrix of pairwise Nei distances (DNei) between Balto-Slavic populations based on Y-chromosome data. Table B in S1 File. Matrix of mean population pairwise FST for Balto-Slavic populations calculated from autosomal SNP data. Table C in S1 File. Frequencies of the mtDNA haplogroups in Balto-Slavic and some other European populations. Table D in S1 File. Matrix of pairwise Nei distances (DNei) between Balto-Slavic populations based on mtDNA data. Table E in S1 File. Predicting the country affiliation for 53 Balto-Slavic populations from their Y-chromosomal composition. Table F in S1 File. Groups of populations used in IBD analysis. Table G in S1 File. Summary statistics of IBD analysis. Table H in S1 File. Analysis of molecular variance (AMOVA) in Balto-Slavic populations. Table I in S1 File. Matrices of geographic (a), lexicostatistical (b) and genetic (c,d,e) distances between Balto-Slavic populations used in Mantel Tests. Table J in S1 File. Results for Mantel tests on genetic, lexicostatistical and geographic distances. Table K in S1 File. Frequencies of the NRY haplogroups in Balto-Slavic populations. Table L in S1 File. Frequencies of NRY haplogroups in 29 Balto-Slavic populations presented here for the first time. Table M in S1 File. Populations used in whole-genome SNP analyses. Table N in S1 File. Frequencies of the NRY haplogroups in non-Balto-Slavic populations of Europe.
(XLSX)
Click here for additional data file.(224K, xlsx)
S2 File
(Linguistics: Datasets; Methods; Results).
Fig A in S2 File. Geographical distribution of extant Slavic and East Baltic languages and dialects used in the study. Map was prepared by Yuri Koryakov. Fig B in S2 File. Dated phylogenetic tree of the Balto-Slavic lects produced by the StarlingNJ method from the multistate matrix (binary nodes only). Bootstrap values are shown near the nodes (not shown for stable nodes with bootstrap value ≥ 95%). Fig C in S2 File. Phylogenetic tree of the Balto-Slavic lects produced by the NJ method from the binary matrix in the SplitsTree4 software. Bootstrap values are shown near the nodes (not shown for stable nodes with bootstrap value ≥ 95%). Branch length reflects the relative rate of cognate replacement as suggested by SplitsTree4. The BioNJ method yields the same topology. Fig D in S2 File. Phylogenetic tree of the Balto-Slavic lects produced by the UPGMA method from the binary matrix in the SplitsTree4 software. Bootstrap values are shown near the nodes (not shown for stable nodes with bootstrap value ≥ 95%). Branch length reflects the relative rate of cognate replacement as suggested by SplitsTree4. Fig E in S2 File. Consensus phylogenetic tree of the Balto-Slavic lects produced by the Bayesian MCMC method from the binary matrix in the MrBayes software. Bayesian posterior probabilities are shown near the nodes (not shown for stable nodes with P ≥ 0.95). Branch length reflects the relative rate of cognate replacement as suggested by MrBayes. Fig F in S2 File. Optimal phylogenetic tree of the Balto-Slavic lects produced by the UMP method from the binary matrix in the TNT software. Bootstrap values are shown near the nodes (not shown for stable nodes with bootstrap value ≥ 95%). Branch length reflects the relative rate of cognate replacement as suggested by TNT. Fig G in
S2 File. Manually constructed consensus phylogenetic tree of the Balto-Slavic lects based on the StarlingNJ, NJ, BioNJ, UPGMA, Bayesian MCMC, UMP methods. Ternary nodes result from neighboring binary nodes, joined together, if the temporal distance between them ≤ 300 years. The gray ellipses additionally mark two joined nodes, which cover binary branchings that differ depending on the method. Probability values are shown in the following sequence: NJ/Bayesian MCMC/UMP (“x” means that P ≥ 0.95 in an individual method; not shown for nodes with P ≥ 0.95 in all methods). StarlingNJ dates are proposed. Fig H in S2 File. NeighborNet network of the Balto-Slavic lects (without Slovenian) + German. Produced in the SplitsTree4 software; bootstrap values are shown near the nodes (not shown for stable nodes with bootstrap value ≥ 95%). Fig I in S2 File. NeighborNet network of the Balto-Slavic lects (without Slovenian) + Demotic Greek. Produced in the SplitsTree4 software; bootstrap values are shown near the nodes (not shown for stable nodes with bootstrap value ≥ 95%). Fig J in S2 File. NeighborNet network of the Balto-Slavic lects (without Slovenian) + German + Demotic Greek. Produced in the SplitsTree4 software; bootstrap values are shown near the nodes (not shown for stable nodes with bootstrap value ≥ 95%). Fig K in S2 File. NeighborNet network of the Balto-Slavic lects (with Slovenian) + German. Produced in the SplitsTree4 software; bootstrap values are shown near the nodes (not shown for stable nodes with bootstrap value ≥ 95%). Fig L in S2 File. NeighborNet network of the Balto-Slavic lects (with Slovenian) + Demotic Greek. Produced in the SplitsTree4 software; bootstrap values are shown near the nodes (not shown for stable nodes with bootstrap value ≥ 95%). Fig M in S2 File. NeighborNet network of the Balto-Slavic lects (with Slovenian) + German + Demotic Greek. Produced in the SplitsTree4 software; bootstrap values are shown near the nodes (not shown for stable nodes with bootstrap value ≥ 95%).
(PDF)
Click here for additional data file.(1.5M, pdf)
S3 File
Table A in S3 File. Lexical dataset (multistate matrix with synonyms allowed). Table B in S3 File. Reverse distance matrix generated from the multistate matrix (Table A in S3 File) in the Starling software. Table C in S3 File. Distance matrix, generated from the binary matrix (bslav.nex (deposited in S1 Dataset)) in the SplitsTree4 software.
(XLSX)
Click here for additional data file.(58K, xlsx)
S1 Text
(Genetics: Datasets, Methods).
(DOCX)
Click here for additional data file.(59K, docx)
Acknowledgments
We are grateful to all the volunteers who have made this study possible by donating their blood samples. We thank V. Ferak, M. Nelis, J. Klovins, and A. Kouvatsi for assistance in sampling. We thank B. Browning for valuable discussion of results of the IBD analysis. We thank Yu. Koryakov for the geographical map of Baltic and Slavic languages distribution, designed especially for this study. Computational analyses of whole genome data were performed on High Performance Computing Center, University of Tartu, Estonia.
The members of the Genographic Consortium are: Li Jin, Hui Li, & Shilin Li (Fudan University, Shanghai, China); Pandikumar Swamikrishnan (IBM, Somers, New York, United States); Asif Javed, Laxmi Parida & Ajay K. Royyuru (IBM, Yorktown Heights, New York, United States); R. John Mitchell (La Trobe University, Melbourne, Victoria, Australia); Pierre A. Zalloua (Lebanese American University, Chouran, Beirut, Lebanon); Syama Adhikarla, Arun Kumar, GaneshPrasad, Ramasamy Pitchappan, Arun Varatharajan Santhakumari (Madurai Kamaraj University, Madurai, Tamil Nadu, India); R. Spencer Wells and Miguel G. Vilar (National Geographic Society, Washington, District of Columbia, United States); Himla Soodyall (National Health Laboratory Service, Johannesburg, South Africa); Elena Balanovska & Oleg Balanovsky (Research Centre for Medical Genetics, Russian Academy of Medical Sciences, Moscow, Russia); Chris Tyler-Smith (The Wellcome Trust Sanger Institute, Hinxton, United Kingdom); Daniela R. Lacerda & Fabrício R. Santos (Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brazil); Jaume Bertranpetit, Marc Haber & Marta Melé (Universitat Pompeu Fabra, Barcelona, Spain); Christina J. Adler, Alan Cooper, Clio S. I. Der Sarkissian & Wolfgang Haak (University of Adelaide, South Australia, Australia); Matthew E. Kaplan & Nirav C. Merchant (University of Arizona, Tucson, Arizona, United States); Colin Renfrew (University of Cambridge, Cambridge, United Kingdom); Andrew C. Clarke & Elizabeth A. Matisoo-Smith (University of Otago, Dunedin, New Zealand); Jill B. Gaieski, Amanda C. Owings & Theodore G. Schurr (University of Pennsylvania, Philadelphia, Pennsylvania, United States). A lead author of the Genographic Consortium is R. Spencer Wells (
gro.sgn@sllewps).
Funding Statement
This work was supported by Russian Science Foundation (grant 14-14-00827 to OB, M. Chuhryaeva, AA and VZ), Programme of the Presidium of Russian Academy of Sciences "Molecular and cell biology", Russian Foundation For Basic Research (grants 13-04-01711, 13-06-00670, 13-04-90420); Ukrainian State Fund for Fundamental Researches (grant F53.4/071); the European Union European Regional Development Fund through the Centre of Excellence in Genomics to the Estonian Biocentre; by the European Commission grant 205419 ECOGENE to the Estonian Biocentre; the Estonian Basic Research Grant SF 0270177s08 and by Institutional Research Funding to the Estonian Biocentre from the Estonian Research Council IUT24-1; the European Commission grant 205419 ECOGENE to the Estonian Biocentre; the Wellcome Trust 098051 to CTS; the Lithuanian part was supported by the LITGEN project (VP1-3.1-ŠMM-07-K-01-013), funded by the European Social Fund under the Global Grant Measure. Center for Genomics and Transcriptomics (CeGaT GmbH) provided support in the form of salaries for author LM, but did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The specific roles of LM are articulated in the ‘author contributions’ section.
Data Availability
The whole genome SNP data generated in this study are available in the National Center for Biotechnology Information – Gene Expression Omnibus (NCBI GEO accession number GSE71049) as well as in PLINK format in our website at
www.ebc.ee/free_data. The NRY dataset is presented in Table N in
S1 File; mtDNA HVS1 sequences are available in the National Center for Biotechnology Information (GenBank accession numbers KT261802–KT262718).
References
1. Fortson Benjamin W. IV. Indo-European Language and Culture: An Introduction. Oxford: Blackwell; 2004. [
Google Scholar]
2. Mallory JP, Adams DQ. The Oxford introduction to Proto-Indo-European and the Proto-Indo-European world. Oxford: Oxford University Press; 2006. [
Google Scholar]
3. Rexová K, Frynta D, Zrzavý J. Cladistic analysis of languages: Indo-European classification based on lexicostatistical data. Cladistics. 2003;19: 120–127. 10.1111/j.1096-0031.2003.tb00299.x [
CrossRef] [
Google Scholar]
4. Ringe D, Warnow T, Taylor A. Indo-European and Computational Cladistics. Trans Philol Soc. 2002;100: 59–129. 10.1111/1467-968X.00091 [
CrossRef] [
Google Scholar]
5. Nakhleh L, Warnow T, Ringe D, Evans SN. A comparison of phylogenetic reconstruction methods on an Indo-European dataset. Trans Philol Soc. 2005;103: 171–192. 10.1111/j.1467-968X.2005.00149.x [
CrossRef] [
Google Scholar]
6. Novotná P, Blažek V. Glottochronology and its application to the Balto-Slavic languages. Baltistica 42/2: 185–210; Baltistica 42/3: 323–346. Baltistica. 2007;42: 323–346. [
Google Scholar]
7. Gray RD, Atkinson QD. Language-tree divergence times support the Anatolian theory of Indo-European origin. Nature. 2003;426: 435–439. 10.1038/nature02029 [
PubMed] [
CrossRef] [
Google Scholar]
8. Bouckaert R, Lemey P, Dunn M, Greenhill SJ, Alekseyenko AV, Drummond AJ, et al. Mapping the origins and expansion of the Indo-European language family. Science. 2012;337: 957–960. 10.1126/science.1219669 [
PMC free article] [
PubMed] [
CrossRef] [
Google Scholar]
9. Schleicher A. Compendium der vergleichenden Grammatik der indogermanischen Sprachen. Weimar: H. Böhlau; 1861. [
Google Scholar]
10. Henrik Birnbaum. Common Slavic: progress and problems in its reconstruction Cambridge Mass: Slavica; 1975. [
Google Scholar]
11. Sussex R, Cubberley P. The Slavic Languages (Cambridge Language Surveys). Cambridge University Press; 2006. [
Google Scholar]
12. Blažek V. From August Schleicher to Sergei Starostin. On the development of the tree-diagram models of the Indo-European languages. 2007;35 Available:
http://www.muni.cz/research/publications/725608 [
Google Scholar]
13. Schafarik PJ. Slawische Alterthümer. Leipzig: Wilhelm Engelmann; 1843. [
Google Scholar]
14. Pogodin AL. Iz istorii slavyanskikh peredvizhenij [History of Slavic studies]. Moskva: tip. Lopukhina; 1901. [
Google Scholar]
15. Rostafiński Józef. O pierwotnych siedzibach i gospodarstwie Słowian w przedhistorycznych czasach Nakł. Akademii Umiejętności; 1908. [
Google Scholar]
16. Sedov VV. Proishozhdenie I rannyaya istoriya slavian [Origin and early history of Slavs]. Moskva: Nauka; 1979. [
Google Scholar]
17. Barford P.M. The Early Slavs: Culture and Society in Early Medieval Eastern Europe. 1st ed. Cornell University Press; 2001. [
Google Scholar]
18. Curta F. The Making of the Slavs: History and Archaeology of the Lower Danube Region. Cambridge University Press; 2001. [
Google Scholar]
19. Heather P. Empires and barbarians. The fall of Rome and the birth of Europe. Oxford: Oxford University Press; 2010. [
Google Scholar]
20. Manco J. Ancestral Journeys: The Peopling of Europe from the First Venturers to the Vikings. 1 edition Thames & Hudson; 2013. [
Google Scholar]
21. Sedov VV. Slaviane: Istoriko-arheologicheskoe issledovanie [Slavs: Historical and archaeological study]. Moskva: Yazyki slavianskoi kultury; 2002. [
Google Scholar]
22. Rosser ZH, Zerjal T, Hurles ME, Adojaan M, Alavantic D, Amorim A, et al. Y-chromosomal diversity in Europe is clinal and influenced primarily by geography, rather than by language. Am J Hum Genet. 2000;67: 1526–1543. 10.1086/316890 [
PMC free article] [
PubMed] [
CrossRef] [
Google Scholar]
23. Semino O, Passarino G, Oefner PJ, Lin AA, Arbuzova S, Beckman LE, et al. The genetic legacy of Paleolithic Homo sapiens sapiens in extant Europeans: a Y chromosome perspective. Science. 2000;290: 1155–1159. [
PubMed] [
Google Scholar]
24. Pericić M, Lauc LB, Klarić IM, Rootsi S, Janićijevic B, Rudan I, et al. High-resolution phylogenetic analysis of southeastern Europe traces major episodes of paternal gene flow among Slavic populations. Mol Biol Evol. 2005;22: 1964–1975. 10.1093/molbev/msi185 [
PubMed] [
CrossRef] [
Google Scholar]
25. Kasperaviciūte D, Kucinskas V. Variability of the human mitochondrial DNA control region sequences in the Lithuanian population. J Appl Genet. 2002;43: 255–260. [
PubMed] [
Google Scholar]
26. Kasperaviciūte D, Kucinskas V, Stoneking M. Y chromosome and mitochondrial DNA variation in Lithuanians. Ann Hum Genet. 2004;68: 438–452. 10.1046/j.1529-8817.2003.00119.x [
PubMed] [
CrossRef] [
Google Scholar]
27. Woźniak M, Malyarchuk B, Derenko M, Vanecek T, Lazur J, Gomolcak P, et al. Similarities and distinctions in Y chromosome gene pool of Western Slavs. Am J Phys Anthropol. 2010;142: 540–548. 10.1002/ajpa.21253 [
PubMed] [
CrossRef] [
Google Scholar]
28. Mielnik-Sikorska M, Daca P, Malyarchuk B, Derenko M, Skonieczna K, Perkova M, et al. The history of Slavs inferred from complete mitochondrial genome sequences. PloS One. 2013;8: e54360 10.1371/journal.pone.0054360 [
PMC free article] [
PubMed] [
CrossRef] [
Google Scholar]
29. Underhill PA, Poznik GD, Rootsi S, Järve M, Lin AA, Wang J, et al. The phylogenetic and geographic structure of Y-chromosome haplogroup R1a. Eur J Hum Genet. 2015;23: 124–131. 10.1038/ejhg.2014.50 [
PMC free article] [
PubMed] [
CrossRef] [
Google Scholar]
30. Malyarchuk BA, Grzybowski T, Derenko MV, Czarny J, Woźniak M, Miścicka-Sliwka D. Mitochondrial DNA variability in Poles and Russians. Ann Hum Genet. 2002;66: 261–283. 10.1017/S0003480002001161 [
PubMed] [
CrossRef] [
Google Scholar]
31. Malyarchuk BA, Grzybowski T, Derenko MV, Czarny J, Drobnic K, Miścicka-Sliwka D. Mitochondrial DNA variability in Bosnians and Slovenians. Ann Hum Genet. 2003;67: 412–425. [
PubMed] [
Google Scholar]
32. Morozova I, Evsyukov A, Kon’kov A, Grosheva A, Zhukova O, Rychkov S. Russian ethnic history inferred from mitochondrial DNA diversity. Am J Phys Anthropol. 2012;147: 341–351. 10.1002/ajpa.21649 [
PubMed] [
CrossRef] [
Google Scholar]
33. Grzybowski T, Malyarchuk BA, Derenko MV, Perkova MA, Bednarek J, Woźniak M. Complex interactions of the Eastern and Western Slavic populations with other European groups as revealed by mitochondrial DNA analysis. Forensic Sci Int Genet. 2007;1: 141–147. 10.1016/j.fsigen.2007.01.010 [
PubMed] [
CrossRef] [
Google Scholar]
34. Karachanak S, Carossa V, Nesheva D, Olivieri A, Pala M, Hooshiar Kashani B, et al. Bulgarians vs the other European populations: a mitochondrial DNA perspective. Int J Legal Med. 2012;126: 497–503. 10.1007/s00414-011-0589-y [
PubMed] [
CrossRef] [
Google Scholar]
35. Karmin M, Saag L, Vicente M, Sayres MAW, Järve M, Talas UG, et al. A recent bottleneck of Y chromosome diversity coincides with a global change in culture. Genome Res. 2015; 10.1101/gr.186684.114 [
PMC free article] [
PubMed] [
CrossRef] [
Google Scholar]
36. Malyarchuk B, Grzybowski T, Derenko M, Perkova M, Vanecek T, Lazur J, et al. Mitochondrial DNA Phylogeny in Eastern and Western Slavs. Mol Biol Evol. 2008;25: 1651–1658. 10.1093/molbev/msn114 [
PubMed] [
CrossRef] [
Google Scholar]
37. Malyarchuk B, Derenko M, Grzybowski T, Perkova M, Rogalla U, Vanecek T, et al. The Peopling of Europe from the Mitochondrial Haplogroup U5 Perspective. PLoS ONE. 2010;5: e10285 10.1371/journal.pone.0010285 [
PMC free article] [
PubMed] [
CrossRef] [
Google Scholar]
38. Juras A, Dabert M, Kushniarevich A, Malmström H, Raghavan M, Kosicki JZ, et al. Ancient DNA Reveals Matrilineal Continuity in Present-Day Poland over the Last Two Millennia. PLoS ONE. 2014;9: e110839 10.1371/journal.pone.0110839 [
PMC free article] [
PubMed] [
CrossRef] [
Google Scholar]
39. Ralph P, Coop G. The Geography of Recent Genetic Ancestry across Europe. PLoS Biol. 2013;11: e1001555 10.1371/journal.pbio.1001555 [
PMC free article] [
PubMed] [
CrossRef] [
Google Scholar]
40. Hellenthal G, Busby GBJ, Band G, Wilson JF, Capelli C, Falush D, et al. A Genetic Atlas of Human Admixture History. Science. 2014;343: 747–751. 10.1126/science.1243518 [
PMC free article] [
PubMed] [
CrossRef] [
Google Scholar]
41. Lao O, Lu TT, Nothnagel M, Junge O, Freitag-Wolf S, Caliebe A, et al. Correlation between genetic and geographic structure in Europe. Curr Biol CB. 2008;18: 1241–1248. 10.1016/j.cub.2008.07.049 [
PubMed] [
CrossRef] [
Google Scholar]
42. Behar DM, Yunusbayev B, Metspalu M, Metspalu E, Rosset S, Parik J, et al. The genome-wide structure of the Jewish people. Nature. 2010;466: 238–242. 10.1038/nature09103 [
PubMed] [
CrossRef] [
Google Scholar]
43. Veeramah KR, Tönjes A, Kovacs P, Gross A, Wegmann D, Geary P, et al. Genetic variation in the Sorbs of eastern Germany in the context of broader European genetic diversity. Eur J Hum Genet EJHG. 2011;19: 995–1001. 10.1038/ejhg.2011.65 [
PMC free article] [
PubMed] [
CrossRef] [
Google Scholar]
44. Yunusbayev B, Metspalu M, Järve M, Kutuev I, Rootsi S, Metspalu E, et al. The Caucasus as an asymmetric semipermeable barrier to ancient human migrations. Mol Biol Evol. 2012;29: 359–365. 10.1093/molbev/msr221 [
PubMed] [
CrossRef] [
Google Scholar]
45. Khrunin AV, Khokhrin DV, Filippova IN, Esko T, Nelis M, Bebyakova NA, et al. A genome-wide analysis of populations from European Russia reveals a new pole of genetic diversity in northern Europe. PloS One. 2013;8: e58552 10.1371/journal.pone.0058552 [
PMC free article] [
PubMed] [
CrossRef] [
Google Scholar]
46. Behar D, Metspalu M, Baran Y, Kopelman N, Yunusbayev B, Gladstein A, et al. No Evidence from Genome-Wide Data of a Khazar Origin for the Ashkenazi Jews. Hum Biol Open Access Pre-Prints. 2013; Available:
http://digitalcommons.wayne.edu/humbiol_preprints/41 [
PubMed]
47. Yunusbayev B, Metspalu M. Metspalu E, Valeev A, Litvinov S, Valiev R, et al. The Genetic Legacy of the Expansion of Turkic-Speaking Nomads Across Eurasia. PlosGenet. 2015; 10.1101/005850 [
PMC free article] [
PubMed] [
CrossRef] [
Google Scholar]
48. Lazaridis I, Patterson N, Mittnik A, Renaud G, Mallick S, Kirsanow K, et al. Ancient human genomes suggest three ancestral populations for present-day Europeans. Nature. 2014;513: 409–413. 10.1038/nature13673 [
PMC free article] [
PubMed] [
CrossRef] [
Google Scholar]
49. Toporov VN, Trubachev ON. Lingvisticheskij analiz gidronimov Verkhnego Podneprov’ya [Linguistic study of hydronyms of Upper Dnieper]. Moskva: Akademiya Nauk SSSR; 1962. [
Google Scholar]
50. Sedov VV. Vostochnye slaviane v VI-XIII vv [East Slavs in 6–8 cc AD]. Moskva: Nauka; 1982. [
Google Scholar]
51. Alexander DH, Novembre J, Lange K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 2009;19: 1655–1664. 10.1101/gr.094052.109 [
PMC free article] [
PubMed] [
CrossRef] [
Google Scholar]
52. Palamara PF, Lencz T, Darvasi A, Pe’er I. Length distributions of identity by descent reveal fine-scale demographic history. Am J Hum Genet. 2012;91: 809–822. 10.1016/j.ajhg.2012.08.030 [
PMC free article] [
PubMed] [
CrossRef] [
Google Scholar]
53. Browning BL, Browning SR. A fast, powerful method for detecting identity by descent. Am J Hum Genet. 2011;88: 173–182. 10.1016/j.ajhg.2011.01.010 [
PMC free article] [
PubMed] [
CrossRef] [
Google Scholar]
54. Wichmann S, Brown CH, Holman EW, editors. ASJP [Internet]. Leipzig: Max Planck Institute for Evolutionary Anthropology; 2014. Available:
http://asjp.clld.org/
55. Kovacevic L, Tambets K, Ilumäe A-M, Kushniarevich A, Yunusbayev B, Solnik A, et al. Standing at the Gateway to Europe—The Genetic Structure of Western Balkan Populations Based on Autosomal and Haploid Markers. PLoS ONE. 2014;9: e105090 10.1371/journal.pone.0105090 [
PMC free article] [
PubMed] [
CrossRef] [
Google Scholar]
56. Balanovsky O, Rootsi S, Pshenichnov A, Kivisild T, Churnosov M, Evseeva I, et al. Two sources of the Russian patrilineal heritage in their Eurasian context. Am J Hum Genet. 2008;82: 236–250. 10.1016/j.ajhg.2007.09.019 [
PMC free article] [
PubMed] [
CrossRef] [
Google Scholar]
57. Rebała K, Mikulich AI, Tsybovsky IS, Siváková D, Dzupinková Z, Szczerkowska-Dobosz A, et al. Y-STR variation among Slavs: evidence for the Slavic homeland in the middle Dnieper basin. J Hum Genet. 2007;52: 406–414. 10.1007/s10038-007-0125-6 [
PubMed] [
CrossRef] [
Google Scholar]
58. Luca F, Di Giacomo F, Benincasa T, Popa LO, Banyko J, Kracmarova A, et al. Y-chromosomal variation in the Czech Republic. Am J Phys Anthropol. 2007;132: 132–139. 10.1002/ajpa.20500 [
PubMed] [
CrossRef] [
Google Scholar]
59. Kayser M, Lao O, Anslinger K, Augustin C, Bargel G, Edelmann J, et al. Significant genetic differentiation between Poland and Germany follows present-day political borders, as revealed by Y-chromosome analysis. Hum Genet. 2005;117: 428–443. 10.1007/s00439-005-1333-9 [
PubMed] [
CrossRef] [
Google Scholar]
60. Rębała K, Martínez-Cruz B, Tönjes A, Kovacs P, Stumvoll M, Lindner I, et al., Genographic Consortium. Contemporary paternal genetic landscape of Polish and German populations: from early medieval Slavic expansion to post-World War II resettlements. Eur J Hum Genet EJHG. 2013;21: 415–422. 10.1038/ejhg.2012.190 [
PMC free article] [
PubMed] [
CrossRef] [
Google Scholar]
61. Bunak VV. Proiskhozhdenie i etnicheskaya istoriya russkogo naroda po antropologicheskim dannym [Origin and ethnic history of Russians from anthropological data]. Moskva: Nauka; 1965. [
Google Scholar]
62. Alekseeva T. Vostochnye Slaviane. Antropologiya i etnicheskaya istoriya [East Slavs. Anthropology and Ethnic history]. Moskva: Nauchnyi Mir; 2011. [
Google Scholar]
63. Balanovska EV, Pezhemski DV, Romanov AG, Baranova EE, Romashkina MV, Agdzhoyan AT, et al. Genofond Russkogo Severa: Slaviane? Finny? Paleoevropeitsy? [Gene pool of Russian north:Slavs? Finns? Paleoeuropeans?]. Vestn Mosk Universtiteta. 2011; 27–58.
64. Nelis M, Esko T, Mägi R, Zimprich F, Zimprich A, Toncheva D, et al. Genetic Structure of Europeans: A View from the North–East. PLoS ONE. 2009;4: e5472 10.1371/journal.pone.0005472 [
PMC free article] [
PubMed] [
CrossRef] [
Google Scholar]
65. Rootsi S, Zhivotovsky LA, Baldovic M, Kayser M, Kutuev IA, Khusainova R, et al. A counter-clockwise northern route of the Y-chromosome haplogroup N from Southeast Asia towards Europe. Eur J Hum Genet EJHG. 2007;15: 204–211. 10.1038/sj.ejhg.5201748 [
PubMed] [
CrossRef] [
Google Scholar]
66. Kushniarevich A, Sivitskaya L, Danilenko N, Novogrodskii T, Tsybovsky I, Kiseleva A, et al. Uniparental genetic heritage of Belarusians: encounter of rare Middle Eastern matrilineages with a Central European mitochondrial DNA pool. PloS One. 2013;8: e66499 10.1371/journal.pone.0066499 [
PMC free article] [
PubMed] [
CrossRef] [
Google Scholar]
67. Cavalli-Sforza LL, Menozzi P, Piazza A. Demic expansions and human evolution. Science. 1993;259: 639–646. 10.1126/science.8430313 [
PubMed] [
CrossRef] [
Google Scholar]
68. Balanovsky O, Dibirova K, Dybo A, Mudrak O, Frolova S, Pocheshkhova E, et al., Genographic Consortium. Parallel evolution of genes and languages in the Caucasus region. Mol Biol Evol. 2011;28: 2905–2920. 10.1093/molbev/msr126 [
PMC free article] [
PubMed] [
CrossRef] [
Google Scholar]
69. Van Oven M, Kayser M. Updated comprehensive phylogenetic tree of global human mitochondrial DNA variation. Hum Mutat. 2009;30: E386–394. 10.1002/humu.20921 [
PubMed] [
CrossRef] [
Google Scholar]
70. Nei M. Genetic distance between populations. Am Nat. 1972;106: 283–92. [
Google Scholar]
71. Patterson N, Price AL, Reich D. Population structure and eigenanalysis. PLoS Genet. 2006;2: e190 10.1371/journal.pgen.0020190 [
PMC free article] [
PubMed] [
CrossRef] [
Google Scholar]
72. Excoffier L, Laval G, Schneider S. Arlequin (version 3.0): an integrated software package for population genetics data analysis. Evol Bioinforma Online. 2005;1: 47–50. [
PMC free article] [
PubMed] [
Google Scholar]
73. Starostin SA. Opredelenie ustojchivosti bazisnoj leksiki [Defining the stability of basic lexicon]. Trudy po yazykoznaniyu. 2007. pp. 827–839.
74. Starostin G. Preliminary Lexicostatistics as a Basis for Language Classification: a New Approach. J Lang Relatsh. 2010;3: 79–116. [
Google Scholar]
75. Koshel SM. Geoinformation technologies in genegeography. Mod Geogr Cartogr Artic Collect Ed IK Lure VI Kravtsova. 2012; 158–66.
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4558026/
@Ројалиста @Kyrios @Krishna @FDDD @Kor @Lanselot @Miki 75 @suave poznat ovaj rad?