Document Databases Schemaless Store data in documents and these documents are grouped into collection If you are familiar with relational databases...
Relational Databases vs NoSQL Relational DatabasesNoSQL Use tables/rows/columnsDon't Use tables/rows/columns Need a predefind schema /...
MapReduce · MapReduce MapReduce is a programming model for data processing which written in various languages. MapReduce programs are inherently...
Meet Hadoop · Meet Hadoop In pioneer days they used oxen for heavy pulling, and when one ox couldn’t budge a log, they didn’t try to grow a larger ox....
Data Structures Structured Semi structured Unstructured Structured Data Easy to search and organized Consistent model, rows and columns Defined...
%lprun command Calculate runtime for each line in the code 1.Load line profiler in your IPython session %load_ext line_profiler 2.Use %lprun...