Applied Data Science - 4 : Data Engineering

Kumaran Ponnambalam, Dedicated to Data Science Education

Play Speed
  • 0.5x
  • 1x (Normal)
  • 1.25x
  • 1.5x
  • 2x
6 Lessons (1h 12m)
    • 1. About Applied Data Science Series

      8:12
    • 2. Data Acquisition

      16:01
    • 3. Data Cleansing

      10:50
    • 4. Data Transformations

      11:09
    • 5. Text Pre Processing TF IDF

      14:53
    • 6. R Examples for Data Engineering

      11:14

Project Description

Text Pre-processing for news items

1. From the internet, collect a set 10 articles/news items.

2. Load them as documents into a dataset

3. Create a corpus using the R tm package

4. Perform the following text pre-processing

  • Remove punctuations
  • Remove stop words
  • Make all lower case
  • Remove white space

5. Submit the resulting code as project output.

Student Projects