On the shape of semantic space - what can we infer from large-scale statistical properties of texts?

Show simple item record

dc.contributor San Miguel Ruibal, Maximino
dc.contributor.author Czégel, Dániel
dc.date 2017
dc.date.accessioned 2018-05-25T11:26:36Z
dc.date.issued 2018-05-25
dc.identifier.uri http://hdl.handle.net/11201/146236
dc.description.abstract [eng] The large amount of digitized linguistic data opens up the unique possibility of using the methodology of complex systems to understand high-level human cognitive processes. Two such issues are i) the way we categorize the continuous space of real-world features into discrete concepts, and ii) the way we use language to copy a line a thought from one brain to another. In this work I address both questions by formulating a simple text generation model which reproduces the three major characteristic large-scale statistical laws of human language streams, namely Zipf’s law, Heaps’ law and Burstiness. Furthermore, the generation itself can be described as a random walk on a scale-free, highly clustered and low dimensional complex network, suggesting that this class of networks is appropriate as a minimal model of the semantic space. Entangling the global characteristics of the semantic space is an inevitable step towards analyzing texts as trajectories in such a space, with promising applications such as author or style identification, personal disorder diagnosis, or the evolution of cultural traits mirrored by text production characteristics. ca
dc.format application/pdf
dc.language.iso eng ca
dc.publisher Universitat de les Illes Balears
dc.rights info:eu-repo/semantics/openAccess
dc.rights all rights reserved
dc.subject 53 - Física ca
dc.title On the shape of semantic space - what can we infer from large-scale statistical properties of texts? ca
dc.type info:eu-repo/semantics/masterThesis ca
dc.type info:eu-repo/semantics/publishedVersion
dc.date.updated 2018-05-22T09:10:37Z
dc.date.embargoEndDate info:eu-repo/date/embargoEnd/2050-01-01
dc.embargo 2050-01-01
dc.rights.accessRights info:eu-repo/semantics/embargoedAccess


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository


Advanced Search

Browse

My Account

Statistics