

You have already added 0 works in your ORCID record related to the merged Research product.
You have already added 0 works in your ORCID record related to the merged Research product.
<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=undefined&type=result"></script>');
-->
</script>
You have already added 0 works in your ORCID record related to the merged Research product.
You have already added 0 works in your ORCID record related to the merged Research product.
Euclidean distance between syntactically linked words

We study the Euclidean distance between syntactically linked words in sentences. The average distance is significantly small and is a very slowly growing function of sentence length. We consider two nonexcluding hypotheses: (a) the average distance is minimized and (b) the average distance is constrained. Support for (a) comes from the significantly small average distance real sentences achieve. The strength of the minimization hypothesis decreases with the length of the sentence. Support for (b) comes from the very slow growth of the average distance versus sentence length. Furthermore, (b) predicts, under ideal conditions, an exponential distribution of the distance between linked words, a trend that can be identified in real sentences. Peer Reviewed
Microsoft Academic Graph classification: Euclidean distance matrix Combinatorics Jaro–Winkler distance Distance from a point to a line Mathematics Discrete mathematics Mahalanobis distance Minkowski distance Euclidean distance Distance matrix Total variation distance of probability measures
arXiv: Computer Science::Computation and Language (Computational Linguistics and Natural Language and Speech Processing)
Statistics as Topic, Computational linguistics, Pattern Recognition, Automated, Artificial Intelligence, Terminology as Topic, Nonexcluding hypotheses, Natural Language Processing, Average distance, Syntactically linked words, Semantics, Vocabulary, Controlled, Sentence length, Linear Models, Lingüística computacional, Euclidean distance, :Informàtica::Intel·ligència artificial::Llenguatge natural [Àrees temàtiques de la UPC]
Statistics as Topic, Computational linguistics, Pattern Recognition, Automated, Artificial Intelligence, Terminology as Topic, Nonexcluding hypotheses, Natural Language Processing, Average distance, Syntactically linked words, Semantics, Vocabulary, Controlled, Sentence length, Linear Models, Lingüística computacional, Euclidean distance, :Informàtica::Intel·ligència artificial::Llenguatge natural [Àrees temàtiques de la UPC]
Microsoft Academic Graph classification: Euclidean distance matrix Combinatorics Jaro–Winkler distance Distance from a point to a line Mathematics Discrete mathematics Mahalanobis distance Minkowski distance Euclidean distance Distance matrix Total variation distance of probability measures
arXiv: Computer Science::Computation and Language (Computational Linguistics and Natural Language and Speech Processing)
citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).91 popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.Top 10% influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).Top 10% impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.Top 10% visibility views 83 download downloads 447 citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).91 popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.Top 10% influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).Top 10% impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.Top 10% Powered byBIP!
- 83views447downloads



We study the Euclidean distance between syntactically linked words in sentences. The average distance is significantly small and is a very slowly growing function of sentence length. We consider two nonexcluding hypotheses: (a) the average distance is minimized and (b) the average distance is constrained. Support for (a) comes from the significantly small average distance real sentences achieve. The strength of the minimization hypothesis decreases with the length of the sentence. Support for (b) comes from the very slow growth of the average distance versus sentence length. Furthermore, (b) predicts, under ideal conditions, an exponential distribution of the distance between linked words, a trend that can be identified in real sentences. Peer Reviewed