Skip to content

Latest commit

 

History

History
55 lines (34 loc) · 1.57 KB

grep.md

File metadata and controls

55 lines (34 loc) · 1.57 KB

Searching text data

So we've cleaned our data set, but how do we find entries that use a particular term?

Let's say I want to find all the entries in our data set that use the term "Paris."

$ cat nypl_items.csv | grep -i "paris"
[...]

This will print out all the lines that contain the word "Paris." (The -i flag makes the command ignore capitalization.) Let's use our wc -l command to see how many lines that is:

$ $ cat nypl_items.csv | grep -i "paris" | wc -l
191

Challenge

Use the grep command to explore our .csv file a bit. What areas are best covered by the data set?

Before we finish...

Before you leave today, we're going to prepare a little for our upcoming sessions. In your projects folder on the desktop, we're going to create a folder to house our cheat sheets for the week, as well as a new folder for the upcoming databases workshop.

$ pwd
/Users/patrick/Desktop/projects
$ mkdir cheatsheets
$ mkdir databases

Then move your cheat-sheet.txt file into your cheatsheets folder and your nypl_items.csv into your databases folder with the mv command:

$ mv cheat-sheet.txt cheatsheets
$ mv nypl_items.csv databases

What next?

That's the end of the command line session, but if you want to continue to learn about the command line, take a look at the other useful commands or additional challenges.

<<< Previous

Go to other commands >>>
Go to further challenges >>>

Example

Searching a text file with grep