Open-access materials

Resources

Free tutorials, R scripts, data sets, and course materials  ·  All open-access via LADAL and GitHub

Tutorials

Below are links to tutorials I created for the Language Technology and Data Analysis Laboratory (LADAL).

DATA SCIENCE BASICS

INTRODUCTION TO R

DATA VISUALIZATION

STATISTICS

TEXT ANALYTICS / TEXT MINING / CORPUS LINGUISTICS

CASE STUDIES / FOCUS TUTORIALS

  • Creating vowel charts with Praat and R: This LADAL tutorial shows how to extract formant values in Praat and use these to create a vowel chart in R.
  • Corpus Linguistics: Gender and Age Differences in Swearing: This LADAL tutorial exemplifies how to perform a simple corpus analysis focusing on gender and age differences in swear word use in Irish English.
  • PDF to txt: This LADAL tutorial shows how to extract text from PDF files into txt files for further processing.

For Students

General Notes for Students attending my Courses (Merkblatt für Seminare)
You will find a document with general information about my seminars here. Please read this document if you are attending or plan to attend one of my seminars! (last updated 2015/02/16)

Model term paper
You will find a model term paper here. This model term paper includes information about the structure, content, and formatting of term papers. You can also use it as a template and use the formatting within the model. (last updated 2015/04/08)


Programming / Software Development / Corpus Linguistics

Below you can find some resources such as scripts and data sets that you may find useful.

R scripts

  • Chi Squared test for subtables of 2*k tables (R script)
  • Configural Frequency Analysis for data with only two level configurations (R script)
  • Function for downloading text from websites to create web corpora (R script)
  • Function providing nice summaries of simple linear regressions (R script)
  • Function providing nice summaries of multiple linear regressions (R script)
  • Function providing nice summaries of fixed-effects binomial logistic regressions (R script)
  • Step-wise step-up model fitting of fixed-effects binomial logistic regressions (R script)
  • Step-wise step-up model fitting of mixed-effects binomial logistic regressions (R script)
  • Step-wise step-down model fitting of mixed-effects binomial logistic regressions (R script)

Biodata scripts & data sets (last updated 2015/02/09)

If you find any bugs in the code or mistakes in the results, please let me know.

TestCorpus
A small sample corpus for testing functions.

(last updated 2024/06)

Back to top