This is a guide to installing MongoDB Compass and connecting to the Nellodee remote server. As well as a brief overview of some simple mongo queries using the Compass GUI.
Navigate to Install MongoDB Compass
Choose the version corresponding to your operating system.
You shouldn't need to install MongoDB itself to connect to a remote instance of MongoDB but I will leave the link to install MongoDB on your local system here: Install MongoDB
If you want to run/test/use MongoDB on your local system, you will need to follow the steps on the link above and make sure you choose the Community Edition corresponding to your operating system.
-
First check to see if the mongod service is already running by using the following command in the terminal
service mongod status
-
If the
Active
property says "active (running)" then you can skip to the next step. -
If the
Active
property says "inactive (dead)" then you can run the following commandsudo systemctl start mongod
At this point you should have a working version of the MongoDB Compass GUI installed and opened and confirmed that the mongo service has started on nellodee.
Watch the video to see me walk through the steps to connect to the mongo service on nellodee.
mongo_compass_nellodee.mp4
- Click
New Connection
button - Click
Advanced Connection Options
expansion button - Choose
Proxy/SSH
tab - Fill in all four fields:
- SSH Hostname:
nellodee.si.umich.edu
- SSH Port:
22
- SSH Username:
your umich uniqname
- SSH Password:
password associated with your uniqname
- SSH Hostname:
- Click
Save & Connect
- You will be prompted to choose a color and enter a name for your connection
At this point you should be connected to the running mongo service on nellodee and see a list of the databases stored in mongo on nellodee. If you are already familiar with MongoDB queries then you should be good to go.
If not, I will link a cheat sheet to standard mongo queries to get you started.
Each method below can be opened up for a more detailed explanation and examples
Filters the documents to pass only the documents that match the specified condition(s) to the next pipeline stage.
The $match
stage has the following prototype form:
{ $match: { <query> } }
$match
takes a document that specifies the query conditions. The query syntax is identical to the read operation query syntax; i.e. $match
does not accept raw aggregation expressions. Instead, use a $expr
query expression to include aggregation expression in $match
.
The examples use a collection named articles
with the following documents:
{ "_id" : ObjectId("512bc95fe835e68f199c8686"), "author" : "dave", "score" : 80, "views" : 100 }
{ "_id" : ObjectId("512bc962e835e68f199c8687"), "author" : "dave", "score" : 85, "views" : 521 }
{ "_id" : ObjectId("55f5a192d4bede9ac365b257"), "author" : "ahn", "score" : 60, "views" : 1000 }
{ "_id" : ObjectId("55f5a192d4bede9ac365b258"), "author" : "li", "score" : 55, "views" : 5000 }
{ "_id" : ObjectId("55f5a1d3d4bede9ac365b259"), "author" : "annT", "score" : 60, "views" : 50 }
{ "_id" : ObjectId("55f5a1d3d4bede9ac365b25a"), "author" : "li", "score" : 94, "views" : 999 }
{ "_id" : ObjectId("55f5a1d3d4bede9ac365b25b"), "author" : "ty", "score" : 95, "views" : 1000 }
The following operation uses $match
to perform a simple equality match:
db.articles.aggregate(
[ { $match : { author : "dave" } } ]
);
The $match
selects the documents where the author
field equals dave
, and the aggregation returns the following:
{ "_id" : ObjectId("512bc95fe835e68f199c8686"), "author" : "dave", "score" : 80, "views" : 100 }
{ "_id" : ObjectId("512bc962e835e68f199c8687"), "author" : "dave", "score" : 85, "views" : 521 }
The `$group` stage separates documents into groups according to a "group key". The output is one document for each unique group key.
A group key is often a field, or group of fields. The group key can also be the result of an expression. Use the _id
field in the $group
pipeline stage to set the group key. See below for usage examples.
In the $group
stage output, the _id
field is set to the group key for that document.
The output documents can also contain additional fields that are set using accumulator expressions.
$group
does not order its output documents.
The $group
stage has the following prototype form:
{
$group:
{
_id: <expression>, // Group key
<field1>: { <accumulator1> : <expression1> },
...
}
}
The <accumulator>
operator must be one of the following accumulator operators:
list of valid accumulators
In mongosh create a sample collection named sales with the following documents:
db.sales.insertMany([
{ "_id" : 1, "item" : "abc", "price" : NumberDecimal("10"), "quantity" : NumberInt("2"), "date" : ISODate("2014-03-01T08:00:00Z") },
{ "_id" : 2, "item" : "jkl", "price" : NumberDecimal("20"), "quantity" : NumberInt("1"), "date" : ISODate("2014-03-01T09:00:00Z") },
{ "_id" : 3, "item" : "xyz", "price" : NumberDecimal("5"), "quantity" : NumberInt( "10"), "date" : ISODate("2014-03-15T09:00:00Z") },
{ "_id" : 4, "item" : "xyz", "price" : NumberDecimal("5"), "quantity" : NumberInt("20") , "date" : ISODate("2014-04-04T11:21:39.736Z") },
{ "_id" : 5, "item" : "abc", "price" : NumberDecimal("10"), "quantity" : NumberInt("10") , "date" : ISODate("2014-04-04T21:23:13.331Z") },
{ "_id" : 6, "item" : "def", "price" : NumberDecimal("7.5"), "quantity": NumberInt("5" ) , "date" : ISODate("2015-06-04T05:08:13Z") },
{ "_id" : 7, "item" : "def", "price" : NumberDecimal("7.5"), "quantity": NumberInt("10") , "date" : ISODate("2015-09-10T08:43:00Z") },
{ "_id" : 8, "item" : "abc", "price" : NumberDecimal("10"), "quantity" : NumberInt("5" ) , "date" : ISODate("2016-02-06T20:20:13Z") },
])
The following aggregation operation uses the $group
stage to count the number of documents in the sales
collection:
db.sales.aggregate( [ { $group: { _id: null, count: { $count: { } } } }] )
The operation returns the following result:
{ "_id" : null, "count" : 8 }
This aggregation operation is equivalent to the following SQL statement:
SELECT COUNT(*) AS count FROM sales
Passes along the documents with the requested fields to the next stage in the pipeline. The specified fields can be existing fields from the input documents or newly computed fields.
The $project
stage has the following prototype form:
{ $project: { <specification(s)> } }
The $project
takes a document that can specify the inclusion of fields, the suppression of the _id
field, the addition of new fields, and the resetting of the values of existing fields. Alternatively, you may specify the exclusion of fields.
The $project
specifications have the following forms:
Form | Description |
---|---|
<field>: <1 or true> |
Specifies the inclusion of a field. Non-zero integers are also treated as true . |
_id: <0 or false> |
Specifies the suppression of the _id field.To exclude a field conditionally, use the REMOVE variable instead. For details, see Exclude Fields Conditionally. |
<field>: <expression> |
Adds a new field or resets the value of an existing field. If the expression evaluates to $$REMOVE , the field is excluded in the output. For details, see Exclude Fields Conditionally. |
<field>:<0 or false> |
Specifies the exclusion of a field. To exclude a field conditionally, use the REMOVE variable instead. For details, see Exclude Fields Conditionally.If you specify the exclusion of a field other than _id , you cannot employ any other $project specification forms. This restriction does not apply to conditionally exclusion of a field using the REMOVE variable.See also the $unset stage to exclude fields. |
-
The
_id
field is, by default, included in the output documents. To include any other fields from the input documents in the output documents, you must explicitly specify the inclusion in$project
. -
If you specify an inclusion of a field that does not exist in the document,
$project
ignores that field inclusion and does not add the field to the document.
By default, the _id
field is included in the output documents. To exclude the _id
field from the output documents, you must explicitly specify the suppression of the _id
field in $project
.
If you specify the exclusion of a field or fields, all other fields are returned in the output documents.
{ $project: { "<field1>": 0, "<field2>": 0, ... } } // Return all but the specified fields
If you specify the exclusion of a field other than _id
, you cannot employ any other $project
specification forms: i.e. if you exclude fields, you cannot also specify the inclusion of fields, reset the value of existing fields, or add new fields. This restriction does not apply to conditional exclusion of a field using the REMOVE
variable.
See also the $unset
stage to exclude fields.
You can use the variable REMOVE
in aggregation expressions to conditionally suppress a field. For an example, see Conditionally Exclude Fields.
Consider a books
collection with the following document:
{ "_id" : 1, title: "abc123", isbn: "0001122223334", author: { last: "zzz", first: "aaa" }, copies: 5}
The following $project
stage includes only the _id
, title
, and the author
fields in its output documents:
db.books.aggregate( [ { $project : { title : 1 , author : 1 } } ] )
The operation results in the following document:
{ "_id" : 1, "title" : "abc123", "author" : { "last" : "zzz", "first" : "aaa" } }
The _id
field is always included by default. To exclude the _id
field from the output documents of the $project
stage, specify the exclusion of the _id
field by setting it to 0
in the projection document.
Consider a books
collection with the following document:
{ "_id" : 1, title: "abc123", isbn: "0001122223334", author: { last: "zzz", first: "aaa" }, copies: 5}
The following $project
stage excludes the _id
field but includes the title
, and the author
fields in its output documents:
db.books.aggregate( [ { $project : { _id: 0, title : 1 , author : 1 } } ] )
The operation results in the following document:
{ "title" : "abc123", "author" : { "last" : "zzz", "first" : "aaa" } }
Consider a books
collection with the following document:
{ "_id" : 1, title: "abc123", isbn: "0001122223334", author: { last: "zzz", first: "aaa" }, copies: 5, lastModified: "2016-07-28"}
The following $project
stage excludes the lastModified
field from the output:
db.books.aggregate( [ { $project : { "lastModified": 0 } } ] )
See also the $unset
stage to exclude fields.
Consider a books
collection with the following document:
{ "_id" : 1, title: "abc123", isbn: "0001122223334", author: { last: "zzz", first: "aaa" }, copies: 5, lastModified: "2016-07-28"}
The following $project
stage excludes the author.first
and lastModified
fields from the output:
db.books.aggregate( [ { $project : { "author.first" : 0, "lastModified" : 0 } } ] )
Alternatively, you can nest the exclusion specification in a document:
db.bookmarks.aggregate( [ { $project: { "author": { "first": 0}, "lastModified" : 0 } } ] )
Both specifications result in the same output:
{ "_id" : 1, "title" : "abc123", "isbn" : "0001122223334", "author" : { "last" : "zzz" }, "copies" : 5,}
See also the $unset
stage to exclude fields.
You can use the variable REMOVE
in aggregation expressions to conditionally suppress a field.
Consider a books
collection with the following document:
{ "_id" : 1, title: "abc123", isbn: "0001122223334", author: { last: "zzz", first: "aaa" }, copies: 5, lastModified: "2016-07-28"}{ "_id" : 2, title: "Baked Goods", isbn: "9999999999999", author: { last: "xyz", first: "abc", middle: "" }, copies: 2, lastModified: "2017-07-21"}{ "_id" : 3, title: "Ice Cream Cakes", isbn: "8888888888888", author: { last: "xyz", first: "abc", middle: "mmm" }, copies: 5, lastModified: "2017-07-22"}
The following $project
stage uses the REMOVE
variable to excludes the author.middle
field only if it equals ""
:
db.books.aggregate( [ { $project: { title: 1, "author.first": 1, "author.last" : 1, "author.middle": { $cond: { if: { $eq: [ "", "$author.middle" ] }, then: "$$REMOVE", else: "$author.middle" } } } }] )
The aggregation operation results in the following output:
{ "_id" : 1, "title" : "abc123", "author" : { "last" : "zzz", "first" : "aaa" } }
{ "_id" : 2, "title" : "Baked Goods", "author" : { "last" : "xyz", "first" : "abc" } }{ "_id" : 3, "title" : "Ice Cream Cakes", "author" : { "last" : "xyz", "first" : "abc", "middle" : "mmm" } }
Adds new fields to documents. `$set` outputs documents that contain all existing fields from the input documents and newly added fields.
New in version 4.2.
The $set
stage is an alias for $addFields
.
Both stages are equivalent to a $project
stage that explicitly specifies all existing fields in the input documents and adds the new fields.
$set
has the following form:
{ $set: { <newField>: <expression>, ... } }
Specify the name of each field to add and set its value to an aggregation expression. For more information on expressions, see Expressions.
If the name of the new field is the same as an existing field name (including _id
), $set
overwrites the existing value of that field with the value of the specified expression.
$set
appends new fields to existing documents. You can include one or more $set
stages in an aggregation operation.
To add field or fields to embedded documents (including documents in arrays) use the dot notation. See example.
To add an element to an existing array field with $set
, use with $concatArrays
. See example.
Create a sample scores
collection with the following:
db.scores.insertMany([ { _id: 1, student: "Maya", homework: [ 10, 5, 10 ], quiz: [ 10, 8 ], extraCredit: 0 }, { _id: 2, student: "Ryan", homework: [ 5, 6, 5 ], quiz: [ 8, 8 ], extraCredit: 8 }])
The following operation uses two $set
stages to include three new fields in the output documents:
db.scores.aggregate( [ { $set: { totalHomework: { $sum: "$homework" }, totalQuiz: { $sum: "$quiz" } } }, { $set: { totalScore: { $add: [ "$totalHomework", "$totalQuiz", "$extraCredit" ] } } }] )
The operation returns the following documents:
{ "_id" : 1, "student" : "Maya", "homework" : [ 10, 5, 10 ], "quiz" : [ 10, 8 ], "extraCredit" : 0, "totalHomework" : 25, "totalQuiz" : 18, "totalScore" : 43}{ "_id" : 2, "student" : "Ryan", "homework" : [ 5, 6, 5 ], "quiz" : [ 8, 8 ], "extraCredit" : 8, "totalHomework" : 16, "totalQuiz" : 16, "totalScore" : 40}
Use dot notation to add new fields to embedded documents.
Create a sample collection vehicles
with the following:
db.vehicles.insertMany([ { _id: 1, type: "car", specs: { doors: 4, wheels: 4 } }, { _id: 2, type: "motorcycle", specs: { doors: 0, wheels: 2 } }, { _id: 3, type: "jet ski" }])
The following aggregation operation adds a new field fuel_type
to the embedded document specs
.
db.vehicles.aggregate( [ { $set: { "specs.fuel_type": "unleaded" } }] )
The operation returns the following results:
{ _id: 1, type: "car", specs: { doors: 4, wheels: 4, fuel_type: "unleaded" } }{ _id: 2, type: "motorcycle", specs: { doors: 0, wheels: 2, fuel_type: "unleaded" } }{ _id: 3, type: "jet ski", specs: { fuel_type: "unleaded" } }
Specifying an existing field name in a $set
operation causes the original field to be replaced.
Create a sample collection called animals
with the following:
db.animals.insertOne( { _id: 1, dogs: 10, cats: 15 } )
The following $set
operation overrides the cats
field:
db.animals.aggregate( [ { $set: { "cats": 20 } }] )
The operation returns the following document:
{ _id: 1, dogs: 10, cats: 20 }
It is possible to replace one field with another. In the following example the item
field substitutes for the _id
field.
Create a sample collection called fruits
contains the following documents:
db.fruits.insertMany([ { "_id" : 1, "item" : "tangerine", "type" : "citrus" }, { "_id" : 2, "item" : "lemon", "type" : "citrus" }, { "_id" : 3, "item" : "grapefruit", "type" : "citrus" }])
The following aggregration operation uses $set
to replace the _id
field of each document with the value of the item
field, and replaces the item
field with a string "fruit"
.
db.fruits.aggregate( [ { $set: { _id : "$item", item: "fruit" } }] )
The operation returns the following:
{ "_id" : "tangerine", "item" : "fruit", "type" : "citrus" }{ "_id" : "lemon", "item" : "fruit", "type" : "citrus" }{ "_id" : "grapefruit", "item" : "fruit", "type" : "citrus" }
Create a sample scores
collection with the following:
db.scores.insertMany([ { _id: 1, student: "Maya", homework: [ 10, 5, 10 ], quiz: [ 10, 8 ], extraCredit: 0 }, { _id: 2, student: "Ryan", homework: [ 5, 6, 5 ], quiz: [ 8, 8 ], extraCredit: 8 }])
You can use $set
with a $concatArrays
expression to add an element to an existing array field. For example, the following operation uses $set
to replace the homework
field with a new array whose elements are the current homework
array concatenated with another array containing a new score [ 7 ]
.
db.scores.aggregate([ { $match: { _id: 1 } }, { $set: { homework: { $concatArrays: [ "$homework", [ 7 ] ] } } }])
The operation returns the following:
{ "_id" : 1, "student" : "Maya", "homework" : [ 10, 5, 10, 7 ], "quiz" : [ 10, 8 ], "extraCredit" : 0 }
Create a sample scores
collection with the following:
db.scores.insertMany([ { _id: 1, student: "Maya", homework: [ 10, 5, 10 ], quiz: [ 10, 8 ], extraCredit: 0 }, { _id: 2, student: "Ryan", homework: [ 5, 6, 5 ], quiz: [ 8, 8 ], extraCredit: 8 }])
The following aggregation operation adds a new field quizAverage
to each document that contains the average of the quiz
array.
db.scores.aggregate( [ { $set: { quizAverage: { $avg: "$quiz" } } }] )
The operation returns the following documents:
[ { _id: 1, student: 'Maya', homework: [ 10, 5, 10 ], quiz: [ 10, 8 ], extraCredit: 0, quizAverage: 9 }, { _id: 2, student: 'Ryan', homework: [ 5, 6, 5 ], quiz: [ 8, 8 ], extraCredit: 8, quizAverage: 8 }]
Sorts all input documents and returns them to the pipeline in sorted order.
The $sort
stage has the following prototype form:
{ $sort: { <field1>: <sort order>, <field2>: <sort order> ... } }
$sort
takes a document that specifies the field(s) to sort by and the respective sort order. <sort order>
can have one of the following values:
Value | Description |
---|---|
1 |
Sort ascending. |
-1 |
Sort descending. |
{ $meta: "textScore" } |
Sort by the computed textScore metadata in descending order. See Text Score Metadata Sort for an example. |
If sorting on multiple fields, sort order is evaluated from left to right. For example, in the form above, documents are first sorted by <field1>
. Then documents with the same <field1>
values are further sorted by <field2>
.
You can sort on a maximum of 32 keys.
MongoDB does not store documents in a collection in a particular order. When sorting on a field which contains duplicate values, documents containing those values may be returned in any order.
If consistent sort order is desired, include at least one field in your sort that contains unique values. The easiest way to guarantee this is to include the _id
field in your sort query.
Consider the following restaurant
collection:
db.restaurants.insertMany( [ { "_id" : 1, "name" : "Central Park Cafe", "borough" : "Manhattan"}, { "_id" : 2, "name" : "Rock A Feller Bar and Grill", "borough" : "Queens"}, { "_id" : 3, "name" : "Empire State Pub", "borough" : "Brooklyn"}, { "_id" : 4, "name" : "Stan's Pizzaria", "borough" : "Manhattan"}, { "_id" : 5, "name" : "Jane's Deli", "borough" : "Brooklyn"},] )
The following command uses the $sort
stage to sort on the borough
field:
db.restaurants.aggregate( [ { $sort : { borough : 1 } } ])
In this example, sort order may be inconsistent, since the borough
field contains duplicate values for both Manhattan
and Brooklyn
. Documents are returned in alphabetical order by borough
, but the order of those documents with duplicate values for borough
might not the be the same across multiple executions of the same sort. For example, here are the results from two different executions of the above command:
{ "_id" : 3, "name" : "Empire State Pub", "borough" : "Brooklyn" }{ "_id" : 5, "name" : "Jane's Deli", "borough" : "Brooklyn" }{ "_id" : 1, "name" : "Central Park Cafe", "borough" : "Manhattan" }{ "_id" : 4, "name" : "Stan's Pizzaria", "borough" : "Manhattan" }{ "_id" : 2, "name" : "Rock A Feller Bar and Grill", "borough" : "Queens" }{ "_id" : 5, "name" : "Jane's Deli", "borough" : "Brooklyn" }{ "_id" : 3, "name" : "Empire State Pub", "borough" : "Brooklyn" }{ "_id" : 4, "name" : "Stan's Pizzaria", "borough" : "Manhattan" }{ "_id" : 1, "name" : "Central Park Cafe", "borough" : "Manhattan" }{ "_id" : 2, "name" : "Rock A Feller Bar and Grill", "borough" : "Queens" }
While the values for borough
are still sorted in alphabetical order, the order of the documents containing duplicate values for borough
(i.e. Manhattan
and Brooklyn
) is not the same.
To achieve a consistent sort, add a field which contains exclusively unique values to the sort. The following command uses the $sort
stage to sort on both the borough
field and the _id
field:
db.restaurants.aggregate( [ { $sort : { borough : 1, _id: 1 } } ])
Since the _id
field is always guaranteed to contain exclusively unique values, the returned sort order will always be the same across multiple executions of the same sort.
For the field or fields to sort by, set the sort order to 1
or -1
to specify an ascending or descending sort respectively, as in the following example:
db.users.aggregate( [ { $sort : { age : -1, posts: 1 } } ])
This operation sorts the documents in the users
collection, in descending order according by the age
field and then in ascending order according to the value in the posts
field.
When comparing values of different BSON types in sort operations, MongoDB uses the following comparison order, from lowest to highest:
-
MinKey (internal type)
-
Null
-
Numbers (ints, longs, doubles, decimals)
-
Symbol, String
-
Object
-
Array
-
BinData
-
ObjectId
-
Boolean
-
Date
-
Timestamp
-
Regular Expression
-
MaxKey (internal type)
For details on the comparison/sort order for specific types, see Comparison/Sort Order.
For a pipeline that includes a $text
search, you can sort by descending relevance score using the { $meta: "textScore" }
expression. In the { <sort-key> }
document, set the { $meta: "textScore" }
expression to an arbitrary field name. The field name is ignored by the query system. For example:
db.users.aggregate( [ { $match: { $text: { $search: "operating" } } }, { $sort: { score: { $meta: "textScore" }, posts: -1 } } ])
This operation uses the $text
operator to match the documents, and then sorts first by the "textScore"
metadata in descending order, and then by the posts
field in descending order. The score
field name in the sort document is ignored by the query system. In this pipeline, the "textScore"
metadata is not included in the projection and is not returned as part of the matching documents. See $meta
for more information.
Deconstructs an array field from the input documents to output a document for _each_
element. Each output document is the input document with the value of the array field replaced by the element.
_each_
element. Each output document is the input document with the value of the array field replaced by the element.You can pass a field path operand or a document operand to unwind an array field.
You can pass the array field path to $unwind
. When using this syntax, $unwind
does not output a document if the field value is null, missing, or an empty array.
{ $unwind: <field path> }
When you specify the field path, prefix the field name with a dollar sign $
and enclose in quotes.
You can pass a document to $unwind
to specify various behavior options.
{ $unwind: { path: <field path>, includeArrayIndex: <string>, preserveNullAndEmptyArrays: <boolean> }}
Field | Type | Description |
---|---|---|
path | string | Field path to an array field. To specify a field path, prefix the field name with a dollar sign $ and enclose in quotes. |
includeArrayIndex | string | Optional. The name of a new field to hold the array index of the element. The name cannot start with a dollar sign $ . |
preserveNullAndEmptyArrays | boolean | Optional. - If true , if the path is null, missing, or an empty array, $unwind outputs the document.- If false , if path is null, missing, or an empty array, $unwind does not output a document.The default value is false . |
-
When the operand does not resolve to an array, but is not missing,
null
, or an empty array,$unwind
treats the operand as a single element array. -
When the operand is
null
, missing, or an empty array$unwind
follows the behavior set for the preserveNullAndEmptyArrays option.
If you specify a path for a field that does not exist in an input document or the field is an empty array, $unwind
, by default, ignores the input document and will not output documents for that input document.
To output documents where the array field is missing, null or an empty array, use the preserveNullAndEmptyArrays option.
In mongosh
, create a sample collection named inventory
with the following document:
db.inventory.insertOne({ "_id" : 1, "item" : "ABC1", sizes: [ "S", "M", "L"] })
The following aggregation uses the $unwind
stage to output a document for each element in the sizes
array:
db.inventory.aggregate( [ { $unwind : "$sizes" } ] )
The operation returns the following results:
{ "_id" : 1, "item" : "ABC1", "sizes" : "S" }{ "_id" : 1, "item" : "ABC1", "sizes" : "M" }{ "_id" : 1, "item" : "ABC1", "sizes" : "L" }
Each document is identical to the input document except for the value of the sizes
field which now holds a value from the original sizes
array.
Consider the clothing
collection:
db.clothing.insertMany([ { "_id" : 1, "item" : "Shirt", "sizes": [ "S", "M", "L"] }, { "_id" : 2, "item" : "Shorts", "sizes" : [ ] }, { "_id" : 3, "item" : "Hat", "sizes": "M" }, { "_id" : 4, "item" : "Gloves" }, { "_id" : 5, "item" : "Scarf", "sizes" : null }])
$unwind
treats the sizes
field as a single element array if:
-
the field is present,
-
the value is not null, and
-
the value is not an empty array.
Expand the sizes
arrays with $unwind
:
db.clothing.aggregate( [ { $unwind: { path: "$sizes" } } ] )
The $unwind
operation returns:
{ _id: 1, item: 'Shirt', sizes: 'S' },{ _id: 1, item: 'Shirt', sizes: 'M' },{ _id: 1, item: 'Shirt', sizes: 'L' },{ _id: 3, item: 'Hat', sizes: 'M' }
-
In document
"_id": 1
,sizes
is a populated array.$unwind
returns a document for each element in thesizes
field. -
In document
"_id": 3
,sizes
resolves to a single element array. -
Documents
"_id": 2, "_id": 4
, and"_id": 5
do not return anything because thesizes
field cannot be reduced to a single element array.
The { path: <FIELD> }
syntax is optional. The following $unwind
operations are equivalent.
db.clothing.aggregate( [ { $unwind: "$sizes" } ] )db.clothing.aggregate( [ { $unwind: { path: "$sizes" } } ] )
The preserveNullAndEmptyArrays
and includeArrayIndex
examples use the following collection:
db.inventory2.insertMany([ { "_id" : 1, "item" : "ABC", price: NumberDecimal("80"), "sizes": [ "S", "M", "L"] }, { "_id" : 2, "item" : "EFG", price: NumberDecimal("120"), "sizes" : [ ] }, { "_id" : 3, "item" : "IJK", price: NumberDecimal("160"), "sizes": "M" }, { "_id" : 4, "item" : "LMN" , price: NumberDecimal("10") }, { "_id" : 5, "item" : "XYZ", price: NumberDecimal("5.75"), "sizes" : null }])
The following $unwind
operation uses the preserveNullAndEmptyArrays option to include documents whose sizes
field is null, missing, or an empty array.
db.inventory2.aggregate( [ { $unwind: { path: "$sizes", preserveNullAndEmptyArrays: true } }] )
The output includes those documents where the sizes
field is null, missing, or an empty array:
{ "_id" : 1, "item" : "ABC", "price" : NumberDecimal("80"), "sizes" : "S" }{ "_id" : 1, "item" : "ABC", "price" : NumberDecimal("80"), "sizes" : "M" }{ "_id" : 1, "item" : "ABC", "price" : NumberDecimal("80"), "sizes" : "L" }{ "_id" : 2, "item" : "EFG", "price" : NumberDecimal("120") }{ "_id" : 3, "item" : "IJK", "price" : NumberDecimal("160"), "sizes" : "M" }{ "_id" : 4, "item" : "LMN", "price" : NumberDecimal("10") }{ "_id" : 5, "item" : "XYZ", "price" : NumberDecimal("5.75"), "sizes" : null }
The following $unwind
operation uses the includeArrayIndex option to include the array index in the output.
db.inventory2.aggregate( [ { $unwind: { path: "$sizes", includeArrayIndex: "arrayIndex" } }])
The operation unwinds the sizes
array and includes the array index in the new arrayIndex
field. If the sizes
field does not resolve to a populated array but is not missing, null, or an empty array, the arrayIndex
field is null
.
{ "_id" : 1, "item" : "ABC", "price" : NumberDecimal("80"), "sizes" : "S", "arrayIndex" : NumberLong(0) }{ "_id" : 1, "item" : "ABC", "price" : NumberDecimal("80"), "sizes" : "M", "arrayIndex" : NumberLong(1) }{ "_id" : 1, "item" : "ABC", "price" : NumberDecimal("80"), "sizes" : "L", "arrayIndex" : NumberLong(2) }{ "_id" : 3, "item" : "IJK", "price" : NumberDecimal("160"), "sizes" : "M", "arrayIndex" : null }
In mongosh
, create a sample collection named inventory2
with the following documents:
db.inventory2.insertMany([ { "_id" : 1, "item" : "ABC", price: NumberDecimal("80"), "sizes": [ "S", "M", "L"] }, { "_id" : 2, "item" : "EFG", price: NumberDecimal("120"), "sizes" : [ ] }, { "_id" : 3, "item" : "IJK", price: NumberDecimal("160"), "sizes": "M" }, { "_id" : 4, "item" : "LMN" , price: NumberDecimal("10") }, { "_id" : 5, "item" : "XYZ", price: NumberDecimal("5.75"), "sizes" : null }])
The following pipeline unwinds the sizes
array and groups the resulting documents by the unwound size values:
db.inventory2.aggregate( [ // First Stage { $unwind: { path: "$sizes", preserveNullAndEmptyArrays: true } }, // Second Stage { $group: { _id: "$sizes", averagePrice: { $avg: "$price" } } }, // Third Stage { $sort: { "averagePrice": -1 } }] )
First Stage:
The $unwind
stage outputs a new document for each element in the sizes
array. The stage uses the preserveNullAndEmptyArrays option to include in the output those documents where sizes
field is missing, null or an empty array. This stage passes the following documents to the next stage:
{ "_id" : 1, "item" : "ABC", "price" : NumberDecimal("80"), "sizes" : "S" }{ "_id" : 1, "item" : "ABC", "price" : NumberDecimal("80"), "sizes" : "M" }{ "_id" : 1, "item" : "ABC", "price" : NumberDecimal("80"), "sizes" : "L" }{ "_id" : 2, "item" : "EFG", "price" : NumberDecimal("120") }{ "_id" : 3, "item" : "IJK", "price" : NumberDecimal("160"), "sizes" : "M" }{ "_id" : 4, "item" : "LMN", "price" : NumberDecimal("10") }{ "_id" : 5, "item" : "XYZ", "price" : NumberDecimal("5.75"), "sizes" : null }
Second Stage:
The $group
stage groups the documents by sizes
and calculates the average price of each size. This stage passes the following documents to the next stage:
{ "_id" : "S", "averagePrice" : NumberDecimal("80") }{ "_id" : "L", "averagePrice" : NumberDecimal("80") }{ "_id" : "M", "averagePrice" : NumberDecimal("120") }{ "_id" : null, "averagePrice" : NumberDecimal("45.25") }
Third Stage:
The $sort
stage sorts the documents by averagePrice
in descending order. The operation returns the following result:
{ "_id" : "M", "averagePrice" : NumberDecimal("120") }{ "_id" : "L", "averagePrice" : NumberDecimal("80") }{ "_id" : "S", "averagePrice" : NumberDecimal("80") }{ "_id" : null, "averagePrice" : NumberDecimal("45.25") }
In mongosh
, create a sample collection named sales
with the following documents:
db.sales.insertMany([ { _id: "1", "items" : [ { "name" : "pens", "tags" : ["writing", "office", "school", "stationary" ], "price" : NumberDecimal("12.00"), "quantity" : NumberInt("5") }, { "name" : "envelopes", "tags" : [ "stationary", "office" ], "price" : NumberDecimal("19.95"), "quantity" : NumberInt("8") } ] }, { _id: "2", "items" : [ { "name" : "laptop", "tags" : [ "office", "electronics" ], "price" : NumberDecimal("800.00"), "quantity" : NumberInt("1") }, { "name" : "notepad", "tags" : [ "stationary", "school" ], "price" : NumberDecimal("14.95"), "quantity" : NumberInt("3") } ] }])
The following operation groups the items sold by their tags and calculates the total sales amount per each tag.
db.sales.aggregate([ // First Stage { $unwind: "$items" }, // Second Stage { $unwind: "$items.tags" }, // Third Stage { $group: { _id: "$items.tags", totalSalesAmount: { $sum: { $multiply: [ "$items.price", "$items.quantity" ] } } } }])
First Stage
The first $unwind
stage outputs a new document for each element in the items
array:
{ "_id" : "1", "items" : { "name" : "pens", "tags" : [ "writing", "office", "school", "stationary" ], "price" : NumberDecimal("12.00"), "quantity" : 5 } }{ "_id" : "1", "items" : { "name" : "envelopes", "tags" : [ "stationary", "office" ], "price" : NumberDecimal("19.95"), "quantity" : 8 } }{ "_id" : "2", "items" : { "name" : "laptop", "tags" : [ "office", "electronics" ], "price" : NumberDecimal("800.00"), "quantity" : 1 } }{ "_id" : "2", "items" : { "name" : "notepad", "tags" : [ "stationary", "school" ], "price" : NumberDecimal("14.95"), "quantity" : 3 } }
Second Stage
The second $unwind
stage outputs a new document for each element in the items.tags
arrays:
{ "_id" : "1", "items" : { "name" : "pens", "tags" : "writing", "price" : NumberDecimal("12.00"), "quantity" : 5 } }{ "_id" : "1", "items" : { "name" : "pens", "tags" : "office", "price" : NumberDecimal("12.00"), "quantity" : 5 } }{ "_id" : "1", "items" : { "name" : "pens", "tags" : "school", "price" : NumberDecimal("12.00"), "quantity" : 5 } }{ "_id" : "1", "items" : { "name" : "pens", "tags" : "stationary", "price" : NumberDecimal("12.00"), "quantity" : 5 } }{ "_id" : "1", "items" : { "name" : "envelopes", "tags" : "stationary", "price" : NumberDecimal("19.95"), "quantity" : 8 } }{ "_id" : "1", "items" : { "name" : "envelopes", "tags" : "office", "price" : NumberDecimal("19.95"), "quantity" : 8 } }{ "_id" : "2", "items" : { "name" : "laptop", "tags" : "office", "price" : NumberDecimal("800.00"), "quantity" : 1 } }{ "_id" : "2", "items" : { "name" : "laptop", "tags" : "electronics", "price" : NumberDecimal("800.00"), "quantity" : 1 } }{ "_id" : "2", "items" : { "name" : "notepad", "tags" : "stationary", "price" : NumberDecimal("14.95"), "quantity" : 3 } }{ "_id" : "2", "items" : { "name" : "notepad", "tags" : "school", "price" : NumberDecimal("14.95"), "quantity" : 3 } }
Third Stage
The $group
stage groups the documents by the tag and calculates the total sales amount of items with each tag:
{ "_id" : "writing", "totalSalesAmount" : NumberDecimal("60.00") }{ "_id" : "stationary", "totalSalesAmount" : NumberDecimal("264.45") }{ "_id" : "electronics", "totalSalesAmount" : NumberDecimal("800.00") }{ "_id" : "school", "totalSalesAmount" : NumberDecimal("104.85") }{ "_id" : "office", "totalSalesAmount" : NumberDecimal("1019.60") }
Takes the documents returned by the aggregation pipeline and writes them to a specified collection. Starting in MongoDB 4.4, you can specify the output database.
The $out
stage must be the last stage in the pipeline. The $out
operator lets the aggregation framework return result sets of any size.
$out
replaces the specified collection if it exists. See Replace Existing Collection for details.
The $out
stage has the following syntax:
-
Starting in MongoDB 4.4,
$out
can take a document to specify the output database as well as the output collection:{ $out: { db: "<output-db>", coll: "<output-collection>" } }
Field Description db The output database name.
- For a replica set or a standalone, if the output database does not exist,$out
also creates the database.
- For a sharded cluster, the specified output database must already exist.coll The output collection name. -
$out
can take a string to specify only the output collection (i.e. output to a collection in the same database):{ $out: "<output-collection>" } // Output collection is in the same database