Discuss Zookeeper ”; Previous Next ZooKeeper is a distributed co-ordination service to manage large set of hosts. Co-ordinating and managing a service in a distributed environment is a complicated process. ZooKeeper solves this issue with its simple architecture and API. ZooKeeper allows developers to focus on core application logic without worrying about the distributed nature of the application. The ZooKeeper framework was originally built at “Yahoo!” for accessing their applications in an easy and robust manner. Later, Apache ZooKeeper became a standard for organized service used by Hadoop, HBase, and other distributed frameworks. For example, Apache HBase uses ZooKeeper to track the status of distributed data. This tutorial explains the basics of ZooKeeper, how to install and deploy a ZooKeeper cluster in a distributed environment, and finally concludes with a few examples using Java programming and sample applications. Print Page Previous Next Advertisements ”;
Category: Big Data & Analytics
Sqoop Questions and Answers ”; Previous Next Sqoop Questions and Answers has been designed with a special intention of helping students and professionals preparing for various Certification Exams and Job Interviews. This section provides a useful collection of sample Interview Questions and Multiple Choice Questions (MCQs) and their answers with appropriate explanations. SN Question/Answers Type 1 Sqoop Interview Questions This section provides a huge collection of Sqoop Interview Questions with their answers hidden in a box to challenge you to have a go at them before discovering the correct answer. 2 Sqoop Online Quiz This section provides a great collection of Sqoop Multiple Choice Questions (MCQs) on a single page along with their correct answers and explanation. If you select the right option, it turns green; else red. 3 Sqoop Online Test If you are preparing to appear for a Java and Sqoop related certification exam, then this section is a must for you. This section simulates a real online test along with a given timer which challenges you to complete the test within a given time-frame. Finally you can check your overall test score and how you fared among millions of other candidates who attended this online test. 4 Sqoop Mock Test This section provides various mock tests that you can download at your local machine and solve offline. Every mock test is supplied with a mock test key to let you verify the final score and grade yourself. Print Page Previous Next Advertisements ”;
Tableau – Bump Chart
Tableau – Bump Chart ”; Previous Next A Bump Chart is used to compare two dimensions against each other using one of the Measure value. They are very useful for exploring the changes in Rank of a value over a time dimension or place dimension or some other dimension relevant to the analysis. The Bump Chart takes two dimensions with zero or more measures. Creating a Bump Chart Using the Sample-superstore, plan to find the variation of ship mode of products with the variation of the Sub-Category. To achieve this objective, following are the steps. Step 1 − Drag and drop the dimension Sub-Category to the Columns shelf. Also drag the dimension Ship mode to the Color shelf under Marks card. Leave the chart type to Automatic. The following chart appears. Step 2 − Next, create a calculated field called Rank. Go to Analysis → Create Calculated Field. Use Rank as the field name and write the expression index () in the calculation area. It is an inbuilt function that creates an index for the current row in the partition. Click OK and the new field will be visible in the measures section. Right-click on the field Rank and convert it to discrete. Step 3 − Drag Rank to the Rows shelf. The following chart appears which shows the dimension Sub-Category with each ship mode arranged in an increasing order of their Rank value. Step 4 − Apply some more calculation to the rank field using the measure Profit. Rightclick on Rank and choose Edit Table calculation. Choose the sorting by the field profit using partition by Sub-Category and addressed by ship mode. The following screenshot shows the calculations applied. On completion of the above steps, you will get the bump chart as shown in the following screenshot. It shows the variation of profit for each ship mode across various subcategories. Print Page Previous Next Advertisements ”;
Sqoop – Useful Resources
Sqoop – Useful Resources ”; Previous Next The following resources contain additional information on Sqoop. Please use them to get more in-depth knowledge on this topic. Useful Video Courses Big Data Analytics Using Hive In Hadoop 21 Lectures 2 hours Mukund Kumar Mishra More Detail Advance Big Data Analytics using Hive & Sqoop Best Seller 51 Lectures 4 hours Navdeep Kaur More Detail Big Data Hadoop Course Best Seller 90 Lectures 11.5 hours TELCOMA Global More Detail Learn Big Data Hadoop: Hands-On for Beginner 256 Lectures 13.5 hours Bigdata Engineer More Detail Big Data Crash Course 68 Lectures 9 hours DataCouch More Detail Big Data For Architects 61 Lectures 7.5 hours DataCouch More Detail Print Page Previous Next Advertisements ”;
Tableau – Functions
Tableau – Functions ”; Previous Next Any data analysis involves a lot of calculations. In Tableau, the calculation editor is used to apply calculations to the fields being analyzed. Tableau has a number of inbuilt functions which help in creating expressions for complex calculations. Following are the description of different categories of functions. Number Functions String Functions Date Functions Logical Functions Aggregate Functions Number Functions These are the functions used for numeric calculations. They only take numbers as inputs. Following are some examples of important number functions. Function Description Example CEILING (number) Rounds a number to the nearest integer of equal or greater value. CEILING(2.145) = 3 POWER (number, power) Raises the number to the specified power. POWER(5,3) = 125 ROUND (number, [decimals]) Rounds the numbers to a specified number of digits. ROUND(3.14152,2) = 3.14 String Functions String Functions are used for string manipulation. Following are some important string functions with examples Function Description Example LEN (string) Returns the length of the string. LEN(“Tableau”) = 7 LTRIM (string) Returns the string with any leading spaces removed. LTRIM(” Tableau “) = “Tableau” REPLACE (string, substring, replacement) Searches the string for substring and replaces it with a replacement. If the substring is not found, the string is not changed. REPLACE(“GreenBlueGreen”, “Blue”, “Red”) = “GreenRedGreen” UPPER (string) Returns string, with all characters uppercase. UPPER(“Tableau”) = “TABLEAU” Date Functions Tableau has a variety of date functions to carry out calculations involving dates. All the date functions use the date_part which is a string indicating the part of the date such as – month, day, or year. Following table lists some examples of important date functions. Function Description Example DATEADD (date_part, increment, date) Returns an increment added to the date. The type of increment is specified in date_part. DATEADD (”month”, 3, #2004-04-15#) = 2004-0715 12:00:00 AM DATENAME (date_part, date, [start_of_week]) Returns date_part of date as a string. The start_of_week parameter is optional. DATENAME(”month”, #200404-15#) = “April” DAY (date) Returns the day of the given date as an integer. DAY(#2004-04-12#) = 12 NOW( ) Returns the current date and time. NOW( ) = 2004-04-15 1:08:21 PM Logical Functions These functions evaluate some single value or the result of an expression and produce a boolean output. Function Description Example IFNULL (expression1, expression2) The IFNULL function returns the first expression if the result is not null, and returns the second expression if it is null. IFNULL([Sales], 0) = [Sales] ISDATE (string) The ISDATE function returns TRUE if the string argument can be converted to a date, and FALSE if it cannot. ISDATE(“11/05/98”) = TRUE ISDATE(“14/05/98”) = FALSE MIN(expression) The MIN function returns the minimum of an expression across all records or the minimum of two expressions for each record. Aggregate Functions Function Description Example AVG(expression) Returns the average of all the values in the expression. AVG can be used with numeric fields only. Null values are ignored. COUNT (expression) Returns the number of items in a group. Null values are not counted. MEDIAN (expression) Returns the median of an expression across all records. Median can only be used with numeric fields. Null values are ignored. STDEV (expression) Returns the statistical standard deviation of all values in the given expression based on a sample of the population. Print Page Previous Next Advertisements ”;
Sqoop – Home
Sqoop Tutorial PDF Version Quick Guide Resources Job Search Discussion Sqoop is a tool designed to transfer data between Hadoop and relational database servers. It is used to import data from relational databases such as MySQL, Oracle to Hadoop HDFS, and export from Hadoop file system to relational databases. This is a brief tutorial that explains how to make use of Sqoop in Hadoop ecosystem. Audience This tutorial is prepared for professionals aspiring to make a career in Big Data Analytics using Hadoop Framework with Sqoop. ETL developers and professionals who are into analytics in general may as well use this tutorial to good effect. Prerequisites Before proceeding with this tutorial, you need a basic knowledge of Core Java, Database concepts of SQL, Hadoop File system, and any of Linux operating system flavors. Print Page Previous Next Advertisements ”;
Tableau – Numeric Calculations ”; Previous Next Numeric calculations in Tableau are done using a wide range of inbuilt functions available in the formula editor. In this chapter, we will see how to apply calculations to the fields. The calculations can be as simple as subtracting the values of two fields or applying an aggregate function to a single field. Following are the steps to create a calculation field and use numeric functions in it. Create Calculated Field While connected to Sample-superstore, go to the Analysis menu and click ‘Create Calculated Field’, as shown in the following screenshot. Calculation Editor The above step opens a calculation editor which lists all the functions that is available in Tableau. You can change the dropdown value and see only the functions related to numbers. Create a Formula To study the difference between profit and discount for different shipping mode of the products, create a formula subtracting the discount from the profit as shown in the following screenshot. Also, name this field as profit_n_discount. Using the Calculated Field The above calculated field can be used in the view by dragging it to the Rows shelf as shown in the following screenshot. It produces a bar chart showing the difference between profit and discount for different shipping modes. Applying Aggregate Calculations In a similar manner as above, you can create a calculated field using aggregate function. Here, create AVG(sales) values for different ship mode. Write the formula in the calculation editor as shown in the following screenshot. On clicking OK and dragging the Avg_Sales field to the Rows shelf, you will get the following view. Print Page Previous Next Advertisements ”;
Tableau – Add Worksheets
Tableau – Add Worksheets ”; Previous Next Worksheet in the Tableau screen is the area where you create the views for data analysis. By default, Tableau provides three blank worksheets when you have established a connection to data source. You can go on adding multiple worksheets to look at different data views in the same screen, one after another. Adding a Worksheet You can add a worksheet in two ways. Right-click on the name of the current worksheet and choose the option New Worksheet from the pop-up menu. You can also click on the small icon to the right of the last sheet name to add a worksheet. Quick Preview of a Worksheet Staying in one worksheet, you can have a quick preview of another worksheet by hovering the mouse on the name of the other worksheet. Print Page Previous Next Advertisements ”;
Root Mean Square
Statistics – Root Mean Square ”; Previous Next Root Mean Square, RMS is defined as the square root of mean square where mean square is the arithmetic mean of the squares of numbers. RMS is also termed as the quadratic mean. Formula ${ x_{rms} = sqrt{ frac{1}{n} ( {x_1}^2 + {x_2}^2 + … + {x_n}^2 } }$ Where − ${x_i}$ = items under observation. ${n}$ = total number of items. Example Problem Statement: Compute the RMS of following data. 5 6 7 8 9 Solution: Step 1: Compute squares of each no. ${ {x_1}^2 + {x_2}^2 + … + {x_n}^2 \[7pt] = 6^2 + 7^2 + 8^2 + 9^2 \[7pt] = 36 + 49 + 64 + 81 \[7pt] = 230 }$ Step 2: Compute mean of squares of each no. ${ frac{1}{n} ({x_1}^2 + {x_2}^2 + … + {x_n}^2 ) \[7pt] = frac{1}{4} (230) \[7pt] = frac{230}{4} \[7pt] = 57.5 }$ Step 3: Compute RMS by taking sqrt of means of squares. ${ x_{rms} = sqrt{ frac{1}{n} ( {x_1}^2 + {x_2}^2 + … + {x_n}^2 } \[7pt] = sqrt {57.5} \[7pt] = frac{230}{4} \[7pt] = 7.58 }$ As a result, RMS is ${7.58}$. Print Page Previous Next Advertisements ”;
Stem and Leaf Plot
Statistics – Stem and Leaf Plot ”; Previous Next Stemplots are similar to histogram with the difference that in histogram, bars are used to compare data and in case of stemplots leaves represents actual numbers to be compared. Stemplots are also called stem and leaves plot as there is one step with largest place value digits on the left and at leaf(ves) to the right. A Stemplot is used to draw quantitative data with fewer than 50 observations. In a stemplot, left side entries are called stems; and the right side entries are called leaves. In figure above, the stems are tens (here 5 represents 50, 6 represents 60, and so on); and the leaves are actual values. Stems and leaves may be labelled as – millions, thousands, ones, tenths, etc. Example Problem Statement: Draw Stemplot diagram for the following data points. 64 82 85 99 96 81 97 80 81 80 84 87 98 75 86 88 82 78 81 86 80 50 84 88 83 82 Solution: Step 1 – Sort the numbers in ascending order. 50 64 75 78 80 80 80 81 81 81 82 82 82 83 84 84 85 86 86 87 88 88 96 97 98 99 Step 2 – Choose step as largest place value. In our case it is 10. So each step will represent 10 units. Step 3 – Group the numbers based on stem value. 50 64 75 78 80 80 80 81 81 81 82 82 82 83 84 84 85 86 86 87 88 88 96 97 98 99 Step 4 – Draw the stem numbers as 10”s place-value digits 5, 6, 7, 8 and 9 (each number is representing 10 units). Draw the leaves as 1”s place value. Print Page Previous Next Advertisements ”;