Generic scripts
- Working with data sets
- Use of SQL queries
- Text mining to identify relations between documents (to be executed maintaining the order of the next points)
- Create a data set with the information on the files stored in a directory
- Create the term document matrix
- Analyze the term-document matrix with the lexical correspondence analysis
- Plot the results of the lexical correspondence analysis
- Text mining and sentiment analysis for italian texts (to be executed maintaining the order of the next points)
- Create a data set with the information on the files stored in a directory
- Read the content of the documents in a data set
- Transforms the documents in a list of words, considering also their sentences
- Rebuild the content of the document adding the reference to the sentences
- Lemmatization and selection of specific types
- Rebuild the sentences with the selected lemmas
- Evaluate the sentiment for each sentence
- Web mining