Sorry, your browser is not supported
To have the best experience using Skillshare, we recommend that you use one of these supported browsers.
Play Speed
  • 0.5x
  • 1x (Normal)
  • 1.25x
  • 1.5x
  • 2x
15 Videos (37m)
    • Class introduction

    • Exploring US University Ranks with Bash Shell (Data Preview)

    • Uniranks: Find the colleges in the ranklist (grep, pipe and wc)

    • Uniranks: Find the number of Institutes from a given and all states (cut and sort)

    • Uniranks: Finding the correlation between university tuition and ranks (tail and redirect)

    • Demonstration (Exploring US Uniranks)

    • Facebook data mining - Data preview (head command)

    • Facebook Data: Find the number of status and most popular status entry (cut, sort, grep, awk)

    • Facebook Data: Building a function to find the most vibrant Facebook status (Bash functions)

    • Facebook Data Mining (Demonstration)

    • Data mining Australian cities crime statistics (Data Preview)

    • AU Data: Finding rows and columns stats (wc, sed, csvstat)

    • AU Data: Finding the top most crime per city (awk)

    • Au Data: Finding the best city in Australia (Bash shell programming)

    • AU Crime Data Mining (Demonstration)


About This Class


This beginner friendly course is specifically designed to show you how to use Bash commands and shell programming to handle textual data which can be a csv format data or systems log file. In this course you will  learn Bash by doing projects. 

However, you need to understand the fact that Bash may not the best way to handle all kinds of data! But there often comes a time when you are provided with a pure Bash environment, such as what you get in the common Linux based Super-computers and you just want an early result or view of the data before you drive into the real programming, using Python, R and SQL, SPSS, and so on. Expertise in these data-intensive languages also comes at the price of spending a lot of time on them.

In contrast, bash scripting is simple, easy to learn and perfect for mining textual data! Particularly if you deal with genomics, microarrays, social networks, life sciences, and so on. It can help you to quickly sort, search, match, replace, clean and optimise various aspect of your data, and you wouldn’t need to go through any tough learning curves. We strongly believe, learning and using Bash shell scripting should be the first step if you want to say, Hello Big Data!

Authored by Ahmed Arefin, PhD and thankfully voiced by A. Collinwood (voice artist). This course is a core component of the 'Learn Scientific Programming' initiative.






Reviews (0)

Ahmed Arefin, PhD

Computation Scientist

Ahmed Arefin, PhD is an enthusiastic computer programmer with more than a decade of well-rounded computational experience. He likes to code, but loves to write, research and teach. He founded the Learn Scientific Programming initiative to teach advanced programming topics in kindergarten-silly approaches.

Following a PhD and Postdoc research in the area of data-parallelism he's moved forward to become a Scientific Computing professional, keeping his research interests on, in the area o...

See full profile