Mongodb Interview Questions (V4.4)
Mongodb Interview Questions (V4.4)
Mongodb Interview Questions (V4.4)
4 ) | mongodb-interview-questions
View on GitHub
Table of Contents
MongoDB Commands
MongoDB Coding Practice
Q. What are NoSQL databases? What are the different types of NoSQL databases?
NoSQL is a non-relational DBMS, that does not require a fixed schema, avoids joins, and is easy to scale. The purpose of using a
NoSQL database is for distributed data stores with humongous data storage needs. NoSQL is used for Big data and real-time web
apps.
Document databases
Key-value stores
Column-oriented databases
Graph databases
1. Document databases
A document database stores data in JSON, BSON , or XML documents. In a document database, documents can be nested.
Particular elements can be indexed for faster querying.
Documents can be stored and retrieved in a form that is much closer to the data objects used in applications, which means less
translation is required to use the data in an application. SQL data must often be assembled and disassembled when moving back
and forth between applications and storage.
Example: Amazon SimpleDB, CouchDB, MongoDB, Riak, Lotus Notes are popular Document originated DBMS systems.
2. Key-value Stores
https://learning-zone.github.io/mongodb-interview-questions/ 1/25
6/16/22, 9:22 AM MongoDB Interview Questions ( v4.4 ) | mongodb-interview-questions
Data is stored in key/value pairs. It is designed in such a way to handle lots of data and heavy load. Key-value pair storage
databases store data as a hash table where each key is unique, and the value can be a JSON, BLOB(Binary Large Objects), string, etc.
Example: of key-value stores are Redis, Voldemort, Riak, and Amazon's DynamoDB.
3. Column-Oriented Databases
Column-oriented databases work on columns and are based on BigTable paper by Google. Every column is treated separately. The
values of single column databases are stored contiguously.
They deliver high performance on aggregation queries like SUM, COUNT, AVG, MIN, etc. as the data is readily available in a column.
Example: Column-based NoSQL databases are widely used to manage data warehouses, business intelligence, CRM, Library card
catalogs, HBase, Cassandra, HBase, Hypertable are examples of a column-based database.
4. Graph Databases
A graph type database stores entities as well the relations amongst those entities. The entity is stored as a node with the
relationship as edges. An edge gives a relationship between nodes. Every node and edge has a unique identifier.
Compared to a relational database where tables are loosely connected, a Graph database is a multi-relational in nature. Traversing
relationships as fast as they are already captured into the DB, and there is no need to calculate them.
Graph base databases mostly used for social networks, logistics, spatial data.
Example: Neo4J, Infinite Graph, OrientDB, FlockDB are some popular graph-based databases.
↥ back to top
Q. What is MongoDB?
MongoDB is a document-oriented NoSQL database used for high volume data storage. Instead of using tables and rows as in the
traditional relational databases, MongoDB makes use of collections and documents. Documents consist of key-value pairs which
are the basic unit of data in MongoDB. Collections contain sets of documents and function which is the equivalent of relational
database tables.
https://learning-zone.github.io/mongodb-interview-questions/ 2/25
6/16/22, 9:22 AM MongoDB Interview Questions ( v4.4 ) | mongodb-interview-questions
Key Features
Key Components
1. _id: The _id field represents a unique value in the MongoDB document. The _id field is like the document's primary key. If you
create a new document without an _id field, MongoDB will automatically create the field.
2. Collection: This is a grouping of MongoDB documents. A collection is the equivalent of a table which is created in any other
RDMS such as Oracle.
3. Cursor: This is a pointer to the result set of a query. Clients can iterate through a cursor to retrieve results.
4. Database: This is a container for collections like in RDMS wherein it is a container for tables. Each database gets its own set of
files on the file system. A MongoDB server can store multiple databases.
5. Document: A record in a MongoDB collection is basically called a document. The document, in turn, will consist of field name and
values.
6. Field: A name-value pair in a document. A document has zero or more fields. Fields are analogous to columns in relational
databases.
Example:
[Read More]
↥ back to top
Indexes are special data structures that store a small portion of the collection's data set in an easy to traverse form. The index
stores the value of a specific field or set of fields, ordered by the value of the field. The ordering of the index entries supports
efficient equality matches and range-based query operations. In addition, MongoDB can return sorted results by using the ordering
in the index.
Example
The createIndex() method only creates an index if an index of the same specification does not already exist. The following
example ( using Node.js ) creates a single key descending index on the name field:
https://learning-zone.github.io/mongodb-interview-questions/ 3/25
6/16/22, 9:22 AM MongoDB Interview Questions ( v4.4 ) | mongodb-interview-questions
console.log(result);
callback(result);
↥ back to top
MongoDB supports user-defined indexes like single field index. A single field index is used to create an index on the single field of a
document. With single field index, MongoDB can traverse in ascending and descending order. By default, each collection has a
single field index automatically created on the _id field, the primary key.
Example
"_id": 1,
"age": 29,
With this kind of index we can improve all the queries that find documents with a condition and the age field, like the following:
db.people.find( { age : 20 } )
2. Compound Index
A compound index is an index on multiple fields. Using the same people collection we can create a compound index combining the
city and age field.
In this case, we have created a compound index where the first entry is the value of the city field, the second is the value of the age
field, and the third is the person.name. All the fields here are defined in ascending order.
3. Multikey Index
This is the index type for arrays. When creating an index on an array, MongoDB will create an index entry for every element.
Example
"_id": 1,
"age": 34,
https://learning-zone.github.io/mongodb-interview-questions/ 4/25
6/16/22, 9:22 AM MongoDB Interview Questions ( v4.4 ) | mongodb-interview-questions
db.people.createIndex( { hobbies: 1} )
4. Geospatial Index
GeoIndexes are a special index type that allows a search based on location, distance from a point and many other different
features. To query geospatial data, MongoDB supports two types of indexes – 2d indexes and 2d sphere indexes . 2d indexes use
planar geometry when returning results and 2dsphere indexes use spherical geometry to return results.
5. Text Index
It is another type of index that is supported by MongoDB. Text index supports searching for string content in a collection. These
index types do not store language-specific stop words (e.g. “the”, “a”, “or”). Text indexes restrict the words in a collection to only
store root words.
Example
entries.insert( {
site: "home",
entries.insert( {
site: "work",
entries.insert( {
site: "home",
title: 10,
text: 5
},
name: "TextIndex",
default_language: "english",
6. Hashed Index
MongoDB supports hash-based sharding and provides hashed indexes. These indexes are the hashes of the field value. Shards use
hashed indexes and create a hash according to the field value to spread the writes across the sharded instances.
↥ back to top
https://learning-zone.github.io/mongodb-interview-questions/ 5/25
6/16/22, 9:22 AM MongoDB Interview Questions ( v4.4 ) | mongodb-interview-questions
TTL ( Time To Live ) is a special option that we can apply only to a single field index to permit the automatic deletion of documents
after a certain time.
During index creation, we can define an expiration time. After that time, all the documents that are older than the expiration time
will be removed from the collection. This kind of feature is very useful when we are dealing with data that don't need to persist in
the database ( eg. session data ).
Example
In this case, MongoDB will drop the documents from the collection automatically once half an hour (1800 seconds) has passed since
the value in lastUpdateTime field.
Restrictions
2. Partial indexes
A partial index is an index that contains only a subset of the values based on a filter rule. They are useful in cases where:
Example
db.people.createIndex(
{ "city": 1, "person.surname": 1 },
We have created a compound index on city and person.surname but only for the documents with age less than 30.
In order for the
partial index to be used the queries must contain a condition on the age field.
3. Sparse indexes
Sparse indexes are a subset of partial indexes. A sparse index only contains elements for the documents that have the indexed
field, even if it is null.
Since MongoDB is a schemaless database, the documents in a collection can have different fields, so an indexed field may not be
present in some of them.
Example
In this case, we are assuming there could be documents in the collection with the field city missing. Sparse indexes are based on
the existence of a field in the documents and are useful to reduce the size of the index.
4. Unique indexes
MongoDB can create an index as unique. An index defined this way cannot contain duplicate entries.
Example
https://learning-zone.github.io/mongodb-interview-questions/ 6/25
6/16/22, 9:22 AM MongoDB Interview Questions ( v4.4 ) | mongodb-interview-questions
↥ back to top
Q. How many indexes does MongoDB create by default for a new collection?
By default MongoDB creates a unique index on the _id field during the creation of a collection. The _id index prevents clients
from inserting two documents with the same value for the _id field.
↥ back to top
Syntax
For example, consider an inventory collection that contains the following documents:
db.inventory.createIndex( { ratings: 1 } )
The following query looks for documents where the ratings field is the array [ 5, 9 ]:
db.inventory.find( { ratings: [ 5, 9 ] } )
MongoDB can use the multikey index to find documents that have 5 at any position in the ratings array. Then, MongoDB retrieves
these documents and filters for documents whose ratings array equals the query array [ 5, 9 ].
↥ back to top
The database profiler collects detailed information about Database Commands executed against a running mongod instance. This
includes CRUD operations as well as configuration and administration commands.
https://learning-zone.github.io/mongodb-interview-questions/ 7/25
6/16/22, 9:22 AM MongoDB Interview Questions ( v4.4 ) | mongodb-interview-questions
db.getProfilingLevel()
// Output
db.getProfilingStatus()
// Output
db.setProfilingLevel(1, 40)
// Output
↥ back to top
The $unset operator deletes a particular field. If the field does not exist, then $unset does nothing. When used with $ to match an
array element, $unset replaces the matching element with null rather than removing the matching element from the array. This
behavior keeps consistent the array size and element positions.
syntax:
Example:
db.collection.update(
{},
$unset : {
"properties.service" : 1
},
multi: true
);
db.collection.find(
"properties.service" : {
$exists : true
).count(true);
↥ back to top
https://learning-zone.github.io/mongodb-interview-questions/ 8/25
6/16/22, 9:22 AM MongoDB Interview Questions ( v4.4 ) | mongodb-interview-questions
MongoDB stores BSON (Binary Interchange and Structure Object Notation) objects in the collection. The concatenation of the
collection name and database name is called a namespace
↥ back to top
With MongoDB, replication is achieved through a Replica Set. Writer operations are sent to the primary server (node), which
applies the operations across secondary servers, replicating the data. If the primary server fails (through a crash or system failure),
one of the secondary servers takes over and becomes the new primary node via election. If that server comes back online, it
becomes a secondary once it fully recovers, aiding the new primary node.
↥ back to top
The primary node receives all write operations. A replica set can have only one primary capable of confirming writes with { w:
"majority" } write concern; although in some circumstances, another mongod instance may transiently believe itself to also be
primary.
The secondaries replicate the primary's oplog and apply the operations to their data sets such that the secondaries' data sets
reflect the primary's data set. If the primary is unavailable, an eligible secondary will hold an election to elect itself the new primary.
↥ back to top
Replica Set
One is for data redundancy for failure recovery. When the hardware fails, or the node is down for other reasons, you can use
a replica for recovery.
The other purpose is for read-write splitting. It routes the reading requests to the replica to reduce the reading pressure on
the primary node.
MongoDB automatically maintains replica sets, multiple copies of data that are distributed across servers, racks and data centers.
Replica sets help prevent database downtime using native replication and automatic failover.
A replica set consists of multiple replica set members. At any given time, one member acts as the primary member, and the other
members act as secondary members. If the primary member fails for any reason (e.g., hardware failure), one of the secondary
members is automatically elected to primary and begins to process all reads and writes.
https://learning-zone.github.io/mongodb-interview-questions/ 9/25
6/16/22, 9:22 AM MongoDB Interview Questions ( v4.4 ) | mongodb-interview-questions
↥ back to top
Example:
In the normalized data model, the address documents contain a reference to the patron document.
// patron document
_id: "joe",
// address documents
city: "Faketon",
state: "MA",
zip: "12345"
patron_id: "joe",
state: "MA",
zip: "12345"
Embedded documents are particularly useful when a one-to-many relationship exists between documents. In the example shown
above, we see that a single customer has multiple addresses associated with him. The nested document structure makes it easy to
retrieve complete address information about this customer with just a single query.
↥ back to top
Q. How can you achieve primary key - foreign key relationships in MongoDB?
The primary key-foreign key relationship can be achieved by embedding one document inside the another. As an example, a
department document can have its employee document(s).
↥ back to top
↥ back to top
https://learning-zone.github.io/mongodb-interview-questions/ 10/25
6/16/22, 9:22 AM MongoDB Interview Questions ( v4.4 ) | mongodb-interview-questions
JSON is formatted as name/value pairs. In JSON documents, field names and values are separated by a colon, field name and value
pairs are separated by commas, and sets of fields are encapsulated in “curly braces” ({}).
Example:
"name": "notebook",
"qty": 50,
"status": "A",
↥ back to top
Joins Yes No
↥ back to top
https://learning-zone.github.io/mongodb-interview-questions/ 11/25
6/16/22, 9:22 AM MongoDB Interview Questions ( v4.4 ) | mongodb-interview-questions
MongoDB allows multiple clients to read and write the same data. In order to ensure consistency, it uses locking and other
concurrency control measures to prevent multiple clients from modifying the same piece of data simultaneously.
MongoDB uses multi-granularity locking that allows operations to lock at the global, database or collection level, and allows for
individual storage engines to implement their own concurrency control below the collection level (e.g., at the document-level in
WiredTiger). MongoDB uses reader-writer locks that allow concurrent readers shared access to a resource, such as a database or
collection.
Example:
The following example highlights the key components of the transactions API
await client.connect();
const transactionOptions = {
readPreference: 'primary',
writeConcern: { w: 'majority' }
};
// Step 3: Use withTransaction to start a transaction, execute the callback, and commit (or abort on error)
// Note: The callback for withTransaction MUST be async and/or return a Promise.
try {
}, transactionOptions);
} finally {
await session.endSession();
await client.close();
↥ back to top
1. MongoDB
MongoDB is one of the most popular document-oriented databases under the banner of NoSQL database. It employs the format of
key-value pairs, here called document store. Document stores in MongoDB are created is stored in BSON files which are, in fact, a
little-modified version of JSON files and hence all JS are supported.
It offers greater efficiency and reliability which in turn can meet your storage capacity and speed demands. The schema-free
implementation of MongoDB eliminates the prerequisites of defining a fixed structure. These models allow hierarchical
relationships representation and facilitate the ability to change the structure of the record.
Pros
MongoDB has a lower latency per query & spends less CPU time per query because it is doing a lot less work (e.g. no joins,
transactions). As a result, it can handle a higher load in terms of queries per second.
MongoDB is easier to shard (use in a cluster) because it doesn't have to worry about transactions and consistency.
MongoDB has a faster write speed because it does not have to worry about transactions or rollbacks (and thus does not have
to worry about locking).
It supports many Features like automatic repair, easier data distribution, and simpler data models make administration and
tuning requirements lesser in NoSQL.
NoSQL databases are cheap and open source.
NoSQL database support caching in system memory so it increases data output performance.
Cons
Storing large volumes of data without structure: A NoSQL database doesn't limit storable data types. Plus, you can add
new types as business needs change.
Using cloud computing and storage: Cloud-based storage is a great solution, but it requires data to be easily spread across
multiple servers for scaling. Using affordable hardware on-site for testing and then for production in the cloud is what NoSQL
databases are designed for.
Rapid development: If you are developing using modern agile methodologies, a relational database will slow you down. A
NoSQL database doesn't require the level of preparation typically needed for relational databases.
2. MySQL
MySQL is a popular open-source relational database management system (RDBMS) that is developed, distributed and supported by
Oracle Corporation. MySQL stores data in tables and uses structured query language (SQL) for database access. It uses Structured
Query Language SQL to access and transfer the data and commands such as ‘SELECT’, ‘UPDATE’, ‘INSERT’ and ‘DELETE’ to manage it.
Related information is stored in different tables but the concept of JOIN operations simplifies the process of correlating it and
performing queries across multiple tables and minimize the chances of data duplication. It follows the ACID (Atomic, Consistent,
Isolated and Durable) model. This means that once a transaction is complete, the data remains consistent and stable on the disc
which may include distinct multiple memory locations.
Pros
Cons
Users have to scale relational database on powerful servers that are expensive and difficult to handle. To scale relational
database, it has to be distributed on to multiple servers. Handling tables across different servers is difficult.
In SQL server's data has to fit into tables anyhow. If your data doesn't fit into tables, then you need to design your database
structure that will be complex and again difficult to handle.
↥ back to top
MongoDB, has always supported ACID transactions in a single document and, when leveraging the document model appropriately,
many applications don't need ACID guarantees across multiple documents.
MongoDB is a document based NoSQL database with a flexible schema. Transactions are not operations that should be executed
for every write operation since they incur a greater performance cost over a single document writes. With a document based
structure and denormalized data model, there will be a minimized need for transactions. Since MongoDB allows document
embedding, you don't necessarily need to use a transaction to meet a write operation.
MongoDB version 4.0 provides multi-document transaction support for replica set deployments only and probably the version
4.2 will extend support for sharded deployments.
These are multi-statement operations that need to be executed sequentially without affecting each other. For example below we
can create two transactions, one to add a user and another to update a user with a field of age.
$session.startTransaction()
session.commit_transaction()
Transactions can be applied to operations against multiple documents contained in one or many collection/database. Any changes
due to document transaction do not impact performance for workloads not related or do not require them. Until the transaction is
committed, uncommitted writes are neither replicated to the secondary nodes nor are they readable outside the transactions.
↥ back to top
For the best database performance on the transaction, developers should consider:
1. Always modify a small number of documents in a transaction. Otherwise, you will need to break the transaction into different
parts and process the documents in different batches. At most, process 1000 documents at a time.
2. Temporary exceptions such as awaiting to elect primary and transient network hiccups may result in abortion of the
transaction. Developers should establish a logic to retry the transaction if the defined errors are presented.
3. Configure optimal duration for the execution of the transaction from the default 60 seconds provided by MongoDB. Besides,
employ indexing so that it can allow fast data access within the transaction.
4. Decompose your transaction into a small set of operation so that it fits the 16MB size constraints. Otherwise, if the operation
together with oplog description exceed this limit, the transaction will be aborted.
5. All data relating to an entity should be stored in a single, rich document structure. This is to reduce the number of documents
that are to be cached when different fields are going to be changed.
↥ back to top
https://learning-zone.github.io/mongodb-interview-questions/ 14/25
6/16/22, 9:22 AM MongoDB Interview Questions ( v4.4 ) | mongodb-interview-questions
↥ back to top
But Mongo follows the exact opposite way of what we do with SQL. In MongoDB, data normalization is not requried. Indeed we
need to de-normalize and fit it into a collection of multiple documents.
ColumnA : ValueA,
ColumnB : ValueB,
Subset1 : [{
ColumnC : ValueC,
ColumnD : ValueD,
Subset2 : [{
ColumnE : ValueE,
ColumnF : ValueF
}]
}]
↥ back to top
Upsert operation is useful while importing data from external source which will update existing documents if matched otherwise it
will insert new documents into collection.
This operation first searches for the document if not present then inserts the new document into the database.
> db.car.update(
... {
... speed: 50
... },
... )
WriteResult({
"nMatched" : 0,
"nUpserted" : 1,
"nModified" : 0,
"_id" : ObjectId("548d3a955a5072e76925dc1c")
})
https://learning-zone.github.io/mongodb-interview-questions/ 15/25
6/16/22, 9:22 AM MongoDB Interview Questions ( v4.4 ) | mongodb-interview-questions
The car with the name Qualis is checked for existence and if not, a document with car name “Qualis” and speed 50 is inserted into
the database. The nUpserted with value “1” indicates a new document is inserted.
↥ back to top
↥ back to top
Q. What is oplog?
The OpLog (Operations Log) is a special capped collection that keeps a rolling record of all operations that modify the data stored in
databases.
MongoDB applies database operations on the primary and then records the operations on the primary's oplog. The secondary
members then copy and apply these operations in an asynchronous process. All replica set members contain a copy of the oplog, in
the local.oplog.rs collection, which allows them to maintain the current state of the database.
Each operation in the oplog is idempotent. That is, oplog operations produce the same results whether applied once or multiple
times to the target dataset.
PRIMARY> db.oplog.rs.find()
↥ back to top
↥ back to top
Example:
db.inventory.insertMany( [
{ item: "journal", qty: 25, size: { h: 14, w: 21, uom: "cm" }, status: "A" },
{ item: "notebook", qty: 50, size: { h: 8.5, w: 11, uom: "in" }, status: "P" },
{ item: "paper", qty: 100, size: { h: 8.5, w: 11, uom: "in" }, status: "D" },
{ item: "planner", qty: 75, size: { h: 22.85, w: 30, uom: "cm" }, status: "D" },
{ item: "postcard", qty: 45, size: { h: 10, w: 15.25, uom: "cm" }, status: "A" },
] );
// Delete Commands
db.inventory.deleteOne( { status: "D" } ) // Delete Only One Document that Matches a Condition
https://learning-zone.github.io/mongodb-interview-questions/ 16/25
6/16/22, 9:22 AM MongoDB Interview Questions ( v4.4 ) | mongodb-interview-questions
↥ back to top
Q. If you remove a document from database, does MongoDB remove it from disk?
Yes. If you remove a document from database, MongoDB will remove it from disk too.
↥ back to top
a 4-byte timestamp value, representing the ObjectId's creation, measured in seconds since the Unix epoch
a 5-byte random value
a 3-byte incrementing counter, initialized to a random value
While the BSON format itself is little-endian, the timestamp and counter values are big-endian, with the most significant bytes
appearing first in the byte sequence.
Create ObjectId
To create a new objectID manually within the MongoDB we can declare objectId() as a method.
// Output
ObjectId("5349b4ddd2781d08c09890f3")
Method Description
ObjectId.toString() Returns the JavaScript representation in the form of a string literal “ObjectId(…)”.
Since all the fields present in the query are part of an index, MongoDB matches the query conditions and returns the result using
the same index without actually looking inside the documents.
Example:
https://learning-zone.github.io/mongodb-interview-questions/ 17/25
6/16/22, 9:22 AM MongoDB Interview Questions ( v4.4 ) | mongodb-interview-questions
A collection inventory has the following index on the type and item fields:
This index will cover the following operation which queries on the type and item fields and returns only the item field:
db.inventory.find(
{ item: 1, _id: 0 }
↥ back to top
If you are running a 64-bit build of MongoDB, there is virtually no limit to storage size.
↥ back to top
↥ back to top
Database systems with large data sets or high throughput applications can challenge the capacity of a single server. For example,
high query rates can exhaust the CPU capacity of the server. Working set sizes larger than the system's RAM stress the I/O capacity
of disk drives. There are two methods for addressing system growth: vertical and horizontal scaling.
1. Vertical Scaling
Vertical Scaling involves increasing the capacity of a single server, such as using a more powerful CPU, adding more RAM, or
increasing the amount of storage space.
2. Horizontal Scaling
Horizontal Scaling involves dividing the system dataset and load over multiple servers, adding additional servers to increase
capacity as required. While the overall speed or capacity of a single machine may not be high, each machine handles a subset of the
overall workload, potentially providing better efficiency than a single high-speed high-capacity server.
MongoDB supports horizontal scaling through sharding . A MongoDB sharded cluster consists of the following components:
Shards: Each shard contains a subset of the sharded data. Each shard can be deployed as a replica set.
Mongos: The mongos acts as a query router, providing an interface between client applications and the sharded cluster.
Starting in MongoDB 4.4, mongos can support hedged reads to minimize latencies.
https://learning-zone.github.io/mongodb-interview-questions/ 18/25
6/16/22, 9:22 AM MongoDB Interview Questions ( v4.4 ) | mongodb-interview-questions
Config Servers: Config servers store metadata and configuration settings for the cluster.
↥ back to top
Aggregate function groups the records in a collection, and can be used to provide total number(sum), average, minimum, maximum
etc out of the group selected. In order to perform the aggregate function in MongoDB, aggregate () is the function to be used.
Syntax
db.collection_name.aggregate(aggregate_operation)
MongoDB's aggregation framework is modeled on the concept of data processing pipelines. Documents enter a multi-stage
pipeline that transforms the documents into an aggregated result.
Example
db.orders.aggregate([
]);
The $match stage filters the documents by the status field and passes to the next stage those documents that have status equal to
“A”. The $group stage groups the documents by the cust_id field to calculate the sum of the amount for each unique cust_id.
Expression Description
$sum Summates the defined values from all the documents in a collection
$avg Calculates the average values from all the documents in a collection
↥ back to top
Q. How can you isolate your cursors from intervening with the write operations?
As cursor is not isolated during its lifetime, thus intervening write operations on a document may result in cursor that returns a
document more than once. The snapshot() method can be used on a cursor to isolate the operation for a very specific case.
snapshot() traverses the index on the _id field and guarantees that the query will return each document no more than once.
https://learning-zone.github.io/mongodb-interview-questions/ 19/25
6/16/22, 9:22 AM MongoDB Interview Questions ( v4.4 ) | mongodb-interview-questions
Restriction:
↥ back to top
↥ back to top
Indexes do not have to fit entirely into RAM in all cases. If the value of the indexed field increments with every insert, and most
queries select recently added documents; then MongoDB only needs to keep the parts of the index that hold the most recent or
“right-most” values in RAM. This allows for efficient index use for read and write operations and minimize the amount of RAM
required to support the index.
> db.collection.totalIndexSize()
4294976499
↥ back to top
Example
db.stores.insert(
1. Text Index
MongoDB provides text indexes to support text search queries on string content. text indexes can include any field whose value
is a string or an array of string elements.
2. $text Operator
Use the $text query operator to perform text searches on a collection with a text index. $text will tokenize the search string using
whitespace and most punctuation as delimiters, and perform a logical OR of all such tokens in the search string.
Example:
https://learning-zone.github.io/mongodb-interview-questions/ 20/25
6/16/22, 9:22 AM MongoDB Interview Questions ( v4.4 ) | mongodb-interview-questions
// Returns all stores containing any terms from the list “coffee”, “shop”, and “java”
↥ back to top
Once the journal commit happens, mongod pushes data into shared view. As part of the process, it gets written to actual data
directory from the shared view (as this process happens in background). The basic advantage is, we have a reduced cycle from 60
seconds to 200 milliseconds.
In a scenario where an abruption occurs at any point of time or flash disc remains unavailable for last 59 seconds , then when the
next time mongod starts, it basically replays all write operation logs and writes into the actual data directory.
↥ back to top
Q. Is MongoDB schema-less?
As a NoSQL database, MongoDB is considered schemaless because it does not require a rigid, pre-defined schema like a relational
database. The database management system (DBMS) enforces a partial schema as data is written, explicitly listing collections and
indexes.
MongoDB is a document based database, which does not use the concept of tables and columns, instead of which it uses the
concept of documents and collections. All the referential data with respect to different modules will be stored as one collection.
More over the BSON data structure used by MongoDB can easily have varying sets of data and fields with different types.
When we say schemaless, we actually mean dynamically typed schema, as opposed to statically typed schemas as available in
RDBMS(SQL) databases. JSON is a completely schema free data structure, as opposed to XML which allows you to specify XSD if you
need.
↥ back to top
https://learning-zone.github.io/mongodb-interview-questions/ 21/25
6/16/22, 9:22 AM MongoDB Interview Questions ( v4.4 ) | mongodb-interview-questions
> db.serverStatus().storageEngine
// Output
"name" : "wiredTiger",
"supportsCommittedReads" : true,
"supportsPendingDrops" : true,
"dropPendingIdents" : NumberLong(0),
"supportsTwoPhaseIndexBuild" : true,
"supportsSnapshotReadConcern" : true,
"readOnly" : false,
"persistent" : true,
"backupCursorOpen" : false
MongoDB supports mainly 3 storage engines whose performance differ in accordance to some specific workloads. The storage
engines are:
WiredTigeris the default storage engine starting in MongoDB 3.2. It is well-suited for most workloads and is recommended for new
deployments. WiredTiger provides a document-level concurrency model, checkpointing, and compression, among other features.
The WiredTiger storage engine has both configurations of a B-Tree Based Engine and a Log Structured Merge Tree Based Engine.
In-Memory Storage Engine is available in MongoDB Enterprise. Rather than storing documents on-disk, it retains them in-memory
for more predictable data latencies.
MMAPv1 is a B-tree based system which powers many of the functions such as storage interaction and memory management to
the operating system. Its name comes from the fact that it uses memory mapped files to access data. It does so by directly loading
and modifying file contents, which are in a virtual memory through a mmap() syscall methodology.
https://learning-zone.github.io/mongodb-interview-questions/ 22/25
6/16/22, 9:22 AM MongoDB Interview Questions ( v4.4 ) | mongodb-interview-questions
↥ back to top
Rewrites and defragments all data and indexes in a collection. On WiredTiger databases, this command will release unneeded disk
space to the operating system. This command will perform a compaction “in-line”.
Syntax
↥ back to top
One to One
One to Many
Many to Many
Example: Let us consider the case of storing addresses for users. So, one user can have multiple addresses making this a 1:N
relationship.
User Collection
"_id":ObjectId("52ffc33cd85242f436000001"),
"contact": "987654321",
"dob": "01-01-1990"
Address Collection
"_id":ObjectId("52ffc4a5d85242602e000000"),
"pincode": 123456,
"state": "California"
In the embedded approach, we will embed the address document inside the user document.
> db.users.insert({
"_id":ObjectId("52ffc33cd85242f436000001"),
"contact": "987654321",
"dob": "01-01-1991",
"address": [
"pincode": 123456,
https://learning-zone.github.io/mongodb-interview-questions/ 23/25
6/16/22, 9:22 AM MongoDB Interview Questions ( v4.4 ) | mongodb-interview-questions
"state": "California"
},
"pincode": 456789,
"city": "Chicago",
"state": "Illinois"
})
This approach maintains all the related data in a single document, which makes it easy to retrieve and maintain. The whole
document can be retrieved in a single query such as −
>db.users.findOne({"name":"Alex K"},{"address":1})
The drawback is that if the embedded document keeps on growing too much in size, it can impact the read/write performance.
This is the approach of designing normalized relationship. In this approach, both the user and address documents will be
maintained separately but the user document will contain a field that will reference the address document's id field.
"_id":ObjectId("52ffc33cd85242f436000001"),
"contact": "987654321",
"dob": "01-01-1991",
"address_ids": [
ObjectId("52ffc4a5d85242602e000000"),
ObjectId("52ffc4a5d85242602e000001")
With this approach, we will need two queries: first to fetch the address_ids fields from user document and second to fetch these
addresses from address collection.
↥ back to top
Capped collections restrict updates to the documents if the update results in increased document size. Since capped collections
store documents in the order of the disk storage, it ensures that the document size does not increase the size allocated on the disk.
Capped collections are best for storing log information, cache data, or any other high volume data.
Example:-
>db.cappedLogCollection.isCapped()
https://learning-zone.github.io/mongodb-interview-questions/ 24/25
6/16/22, 9:22 AM MongoDB Interview Questions ( v4.4 ) | mongodb-interview-questions
>db.cappedLogCollection.find().sort({$natural: -1})
↥ back to top
↥ back to top
https://learning-zone.github.io/mongodb-interview-questions/ 25/25