Notebook Five | Repository

Semantic Networks

Andrea Leone
University of Trento
February 2022


Load the pipeline


Process all transcripts


Semantic distribution


Let's start with the token "creativity"


Distribution of the tagged talks in the years




Define the dataset, split for the two decades: 2002-2012 and 2012-2022


Define the logic to analyse the texts in the dataset: get the lemmas and collect the similarity score with the focused tag


The analysis extracts a particular part-of-speech role; in this case, all nouns (suggested alternative: verbs)


Use the sorted results of the analysis to create a graph for each decade


Semantic tag distribution


Aggregate the relationship among tags


Like before, create a graph from the results for both decades


Plot the zipfian word count distribution