Searching text data

So we've cleaned our data set, but how do we find entries that use a particular term?

Let's say I want to find all the entries in our data set that use the term "Paris."

$ cat nypl_items.csv | grep -i "paris"
[...]

This will print out all the lines that contain the word "Paris." (The -i flag makes the command ignore capitalization.) Let's use our wc -l command to see how many lines that is:

$ $ cat nypl_items.csv | grep -i "paris" | wc -l
191

Challenge

Use the grep command to explore our .csv file a bit. What areas are best covered by the data set?

Before we finish...

Before you leave today, we're going to prepare a little for our upcoming sessions. In your projects folder on the desktop, we're going to create a folder to house our cheat sheets for the week, as well as a new folder for the upcoming databases workshop.

$ pwd
/Users/jojo/Desktop/projects
$ mkdir cheatsheets
$ mkdir databases

Then move your cheat-sheet.txt file into your cheatsheets folder and your nypl_items.csv into your databases folder with the mv command:

$ mv cheat-sheet.txt cheatsheets
$ mv nypl_items.csv databases

What next?

That's the end of the command line session, but if you want to continue to learn about the command line, take a look at the other useful commands or additional challenges.

<<< Previous

Go to other commands >>>
Go to further challenges >>>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

grep.md

grep.md

Searching text data

Challenge

Before we finish...

What next?

Example

Files

grep.md

Latest commit

History

grep.md

File metadata and controls

Searching text data

Challenge

Before we finish...

What next?

Example