MapReduce is a programming model suitable for processing of huge data. Hadoop is capable of running MapReduce programs written in various languages: Java, Ruby, Python, and C++. MapReduce programs are parallel in nature, thus are very useful for performing large-scale data analysis using multiple machines in the cluster. MapReduce programs work in two phases: 1. … Continue reading What is MapReduce? How it Works
Month: April 2018
What is Big Data? Big data is a collection of large datasets that cannot be processed using traditional computing techniques. Testing of these datasets involves various tools, techniques and frameworks to process. Big data relates to data creation, storage, retrieval and analysis that is remarkable in terms of volume, variety, and velocity. Big Data Testing … Continue reading Big Data Testing: Functional & Performance
One of Big Data SQL’s key benefits is that it leverages the great performance capabilities of Oracle Database 12c. I thought it would be interesting to illustrate an example – and in this case we’ll review a performance optimization that has been around for quite a while and is used at thousands of customers: Materialized … Continue reading Using Materialized Views with Big Data SQL to Accelerate Performance
As data science has matured over the past few years, so has the need for a different approach to data and its “bigness.” There are business applications where Hadoop outperforms the newcomer Spark, but Spark has its place in the big data space because of its speed and its ease of use. This analysis examines … Continue reading Hadoop vs. Spark: The New Age of Big Data
Problem I need to export data from the Hadoop Distributed File System (HDFS) to a SQL Server database table. How can I do this? Solution Apache's Sqoop allows for importing data from a database such as SQL Server to the HDFS, and for exporting data from the HDFS to a database table. In this tip … Continue reading Export from Hadoop File System to a SQL Server Database Table
There are many advantages of processing Big Data Analytics in real-time. Knowing errors instantly within the organisation. Implementing new strategies To improve service dramatically Fraud can be detected the moment it happens Cost savings Better sales insights Keep up the customer trends The advantages of processing Big Data in real-time are many: Errors within the … Continue reading What are the advantages of Big Data Analytics?
Big data means the huge amount of data which is beyond the processing capability of traditional data management system to manage and analyse the data in a specified time span.Big Data comes from many sources, some of them are digital media,online transaction records, cellphone signals etc. Hadoop has its own advantage,and In order to overcome … Continue reading What is the advantages of Hadoop and Big data?