Mongodb - The Nosql Database
Mongodb - The Nosql Database
Mongodb - The Nosql Database
Akhil Latta
Software Engineer
INTRODUCTION
Development of MongoDB began at 10gen in 2007, when the company was building a
platform as a service similar to Windows Azure or Google App Engine. In 2009, MongoDB
was open sourced as a stand-alone product with an AGPL license.
International Journal of Enterprise Computing and Business Systems
ISSN (Online) : 2230-8849
Volume 2 Issue 2 July 2013
International Manuscript ID : ISSN22308849-V2I2M2-072013
In March 2010, from version 1.4, MongoDB has been considered production ready. The
latest stable version, 2.4.5, was released in July 3 2013.
MongoDB is available for free under the GNU Affero General Public License. The language
drivers are available under an Apache License. In addition, 10gen offers commercial
licenses for MongoDB.
MAIN FEATURES
Ad hoc queries
MongoDB supports search by field, range queries, regular expression searches.
Queries can return specific fields of documents and also include user-defined
JavaScript functions.
Indexing
Any field in a MongoDB document can be indexed (indices in MongoDB are
conceptually similar to those in RDBMSes). Secondary indices are also available.
Replication
MongoDB supports master-slave replication. A master can perform reads and writes.
A slave copies data from the master and can only be used for reads or backup (not
writes). The slaves have the ability to select a new master if the current one goes
down.
Load balancing
MongoDB scales horizontally using sharding. The developer chooses a shard key,
which determines how the data in a collection will be distributed. The data is split into
ranges (based on the shard key) and distributed across multiple shards. (A shard is a
master with one or more slaves.)
International Journal of Enterprise Computing and Business Systems
ISSN (Online) : 2230-8849
Volume 2 Issue 2 July 2013
International Manuscript ID : ISSN22308849-V2I2M2-072013
MongoDB can run over multiple servers, balancing the load and/or duplicating data to
keep the system up and running in case of hardware failure. Automatic configuration
is easy to deploy and new machines can be added to a running database.
File storage
MongoDB could be used as a file system, taking advantage of load balancing and
data replication features over multiple machines for storing files.
This function, called GridFS, is included with MongoDB drivers and available with no
difficulty for development languages. MongoDB exposes functions for file
manipulation and content to developers. GridFS is used, for example, in plugins for
NGINX. and lighttpd
In a multi-machine MongoDB system, files can be distributed and copied multiple
times between machines transparently, thus effectively creating a load balanced and
fault tolerant system.
Aggregation
MapReduce can be used for batch processing of data and aggregation operations.
The aggregation framework enables users to obtain the kind of results for which the
SQL GROUP BY clause is used.
Server-side JavaScript execution
JavaScript can be used in queries, aggregation functions (such as MapReduce), are
sent directly to the database to be executed.
Capped collections
MongoDB supports fixed-size collections called capped collections. This type of
collection maintains insertion order and, once the specified size has been reached,
behaves like a circular queue.
LANGUAGE SUPPORT
MongoDB has official drivers for a variety of popular programming languages and
development environments. Web programming language Opa also has built-in support for
MongoDB, which is tightly integrated in the language and offers a type-safety layer on top of
International Journal of Enterprise Computing and Business Systems
ISSN (Online) : 2230-8849
Volume 2 Issue 2 July 2013
International Manuscript ID : ISSN22308849-V2I2M2-072013
MongoDB. There are also a large number of unofficial or community-supported drivers for
other programming languages and frameworks.
HTTP/REST INTERFACES
There are REST and HTTP interfaces that allow the manipulation of MongoDB entries via
HTTP GET, POST, UPDATE, and DELETE calls.
MONGODB TOOLS
mongo
MongoDB offers an interactive shell called mongo, which lets developers view,
insert, remove, and update data in their databases, as well as get replication
information, set up sharding, shut down servers, execute JavaScript, and more.
Administrative information can also be accessed through a web interface, a simple
webpage that serves information about the current server status. By default, this
interface is 1000 ports above the database port (28017).
mongostat
mongostat is a command-line tool that displays a summary list of status statistics for
a currently running MongoDB instance: how many inserts, updates, removes,
queries, and commands were performed, as well as what percentage of the time the
database was locked and how much memory it is using. This tool is similar to the
UNIX/Linux vmstat utility.
mongotop
mongotop is a command-line tool providing a method to track the amount of time a
MongoDB instance spends reading and writing data. mongotop provides statistics on
International Journal of Enterprise Computing and Business Systems
ISSN (Online) : 2230-8849
Volume 2 Issue 2 July 2013
International Manuscript ID : ISSN22308849-V2I2M2-072013
the per-collection level. By default, mongotop returns values every second. This tool
is similar to the UNIX/Linux top utility.
mongosniff
mongosniff is a command-line tool providing a low-level tracing/sniffing view into
database activity by monitoring (or "sniffing") network traffic going to and from
MongoDB. mongosniff requires the Libpcap network library and is only available for
Unix-like systems. A cross-platform alternative is the open source Wireshark packet
analyzer which has full support for the MongoDB wire protocol.
mongoimport, mongoexport
mongoimport is a command-line utility to import content from a JSON, CSV, or TSV
export created by mongoexport or potentially other third-party data exports. Usage
information can be found in the MongoDB Manual's section on Importing and
Exporting MongoDB Data.
mongodump, mongorestore
mongodump is a command-line utility for creating a binary export of the contents of a
Mongo database; mongorestore can be used to reload a database dump. Data
backup strategies and considerations are detailed in the MongoDB Manual's section
on Backup and Restoration Strategies.
MONITORING PLUGINS
There are MongoDB monitoring plugins available for the following network tools:
More monitoring and diagnostic tools for MongoDB are listed on MongoDB Admin Zone:
Monitoring and Diagnostics
International Journal of Enterprise Computing and Business Systems
ISSN (Online) : 2230-8849
Volume 2 Issue 2 July 2013
International Manuscript ID : ISSN22308849-V2I2M2-072013
Several GUIs have been created by MongoDB's developer community to help visualize their
data. Some popular ones are:
More client tools for MongoDB are listed on MongoDB Administrator Manual
International Journal of Enterprise Computing and Business Systems
ISSN (Online) : 2230-8849
Volume 2 Issue 2 July 2013
International Manuscript ID : ISSN22308849-V2I2M2-072013
• Jaspersoft BI: Java based Report Designer and Report Server that supports
MongoDB
• Pentaho: MongoDB connectors for Pentaho Kettle and Pentaho BI
• RJMetrics: A hosted Business Intelligence Solution that supports MongoDB.
• eCommerce Analytics: eCommerce Analytics Software that supports MongoDB data
analysis.
• Nucleon BI Studio: MS Windows based business intelligence software that supports
MongoDB and other RDBMS.
REFERENCES
1. DB-Engines Ranking
2. MongoDB daddy: My baby beats Google BigTable
a b
3. The MongoDB NoSQL Database Blog, The AGPL
4. The MongoDB NoSQL Database Blog, MongoDB 1.4 Ready for Production
5. MongoDB Support by 10gen
6. Article "Sharding" on MongoDB Administrator's Manual
7. GridFS article on MongoDB Developer's Manual
8. NGINX plugin for MongoDB source code
9. lighttpd plugin for MongoDB source code.
10. "MongoDB Drivers and Client Libraries — MongoDB Ecosystem 2.2.2".
Mongodb.org. Retrieved 2013-07-08.