To use these functions, you need to install R. You may also want to install the free version of RStudio, as it it a little more user-friendly.
R is a free software environment for statistical computing and graphics. It compiles and runs on a wide variety of UNIX platforms, Windows and MacOS. https://cran.rstudio.com/
RStudio is a set of integrated tools designed to help you be more productive with R. It includes a console, syntax-highlighting editor that supports direct code execution, and a variety of robust tools for plotting, viewing history, debugging and managing your workspace. https://rstudio.com/products/rstudio/download/#download
If you’ve never used R before, or haven’t installed the following packages, you’ll need them to run the function
install.packages(c("tidyverse", "here", "readr", "rio"))
you want to have the file “paragraph_transcript.R” in the same file where you’re running the script
You’ll also want to set the input filename (the name of the transcript in a .txt format), and the output file name (what you want the output file to be called). You also want to set the “path” to where the input and output files will be. here can be a
source("paragraph_transcript.R")
input_filename <- "your transcript.txt"
output_filename <- "your transcript_processed.txt"
path <- "yourfilepath"
setwd(path)
paragraph_transcript(path, input_filename, output_filename)
There will be an error message if the speaker is missing that says “Check row(s) N and check name”.
You should add the name using a Speaker: text format
Once you’ve made those edits, you can re-run the code
Otherwise it will say [1] “No errors”, and you should see your output file in the specified computer directory.
Yayy! You can use your text processor of choice now (e.g. MS Word) to open up your output .txt file and edit your transcript for accuracy.
Hint: You can use Ctrl + A to highlight all of the text, and apply a preset format to make the document look even better