• Modeling word trees in historical linguistics: Preliminary ideas for the reconciliation of word trees and language trees

    Author(s):
    Johann-Mattis List, Nathanael Erik Schweikhard (see profile)
    Date:
    2020
    Subject(s):
    Computational linguistics, Data curation, Etymology, Historical linguistics, Morphology
    Item Type:
    Article
    Tag(s):
    Indo-European, linguistic data, Networks, tree reconciliation, word trees
    Permanent URL:
    http://dx.doi.org/10.17613/8h49-rp11
    Abstract:
    Linguists have acknowledged for a long time that words have their own history that does not necessarily fully align with the history of the languages as a whole. Recent approaches in computational historical linguistics have started integrating this fact into their models of language change, yet without modelling linguistic processes like word formation in detail. Inspired by advances in evolutionary biology, we propose a framework for annotating etymological relationships within word families as word trees to make this kind of data available for quantitative and qualitative studies. We test our dataset on a data sample of etymologies and include a small selection of Python scripts that enable checking the annotation for consistency and deriving basic statistics.
    Notes:
    To be published after revision in the Sektionsband Historische Linguistik, the proceedings of the GeSuS Conference Warszawa 2019.
    Metadata:
    Status:
    Published
    Last Updated:
    6 months ago
    License:
    All Rights Reserved
    Share this:

    Downloads

    Item Name:pdf word_trees.pdf
     Download View in browser
    Activity: Downloads: 92