0% found this document useful (0 votes)
271 views

Fresco

This document provides questions about data visualization concepts and tools like D3.js and R. Some key points covered include: - D3.js is used to create SVG visualizations and its main selection uses the selection.append method. JSON data can be read using d3.json and scales can be created with methods like d3.scaleLinear. - In R, packages like ggplot2 and graphics support visualization. Functions like plot, hist, and boxplot can be used to create univariate and multivariate visualizations. Parameters like xlab, ylab, main control axis labels and titles. - Ggplot2 in R requires data to be in a dataframe and uses aesthetics like size
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as XLSX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
271 views

Fresco

This document provides questions about data visualization concepts and tools like D3.js and R. Some key points covered include: - D3.js is used to create SVG visualizations and its main selection uses the selection.append method. JSON data can be read using d3.json and scales can be created with methods like d3.scaleLinear. - In R, packages like ggplot2 and graphics support visualization. Functions like plot, hist, and boxplot can be used to create univariate and multivariate visualizations. Parameters like xlab, ylab, main control axis labels and titles. - Ggplot2 in R requires data to be in a dataframe and uses aesthetics like size
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as XLSX, PDF, TXT or read online on Scribd
You are on page 1/ 50

https://play.fresco.

me/course/1

What needs to be considered first before you start a data visualization presentation?
How does visualization of big data help in interpreting information?
What is the purpose of using a dashboard for data visualization?
Data Visualization makes complex data more

Which type of visual aspect could be considered ineffective or not very accurate at conveying qua
Which law of Gestalt psychology explains individual's ability to perceive objects as shapes, letters
 A good data visualization
While at a party you notice a group of people in the crowd headed in the same direction. Your t
As per Edward Tufte, good visualizations strive towards

A bar graph is used to represent which type of data?


Which chart displays data as a collection of points, each having one variable determining the posi
When using a Bar Chart, the data values are represented by?
What differentiates histogram from a bar chart?
What type of chart can be used to place some emphasis on any trends in the data visualization p

Which of the following is powerful visualization technique for illustrating hierarchical data and p
Which map is a two-dimensional representation of data in which values are represented by color
Which of the following cannot be a geo-dimension?
Interactivity in Visualizations enables users to
A Manufacturing company wants to view the performance of various plants based on Throughput,

What is the minimum requirement to form a bar chart?


Which one of these is not a data visualization tool?
Which of these can be used to display data graphically?
What method of data representation is best suited to the demonstration of data results if that da
Representation of information in a graphic format to make the data easily understandable at a g
The tendency to treat lines as if they continue is called
Why is text added to a data visualization presentation?
How is data visualization used?
A histogram is a graphical representation of the distribution of data. It is an estimate of the _ of
Which Gestalt law explains individuals ability to mentally grouping items by color, texture, size o
In a Pie Chart, _ is used to represent data.
In which chart type quantitative data is displayed graphically and the area between axis and line
A histogram is used for which type of data?
In project, % compliance to SLAs metric is effected by factors like inflow of tickets, # of interface
Which chart displays information as a series of data points called 'markers' connected by straigh
The axis-plot/ map on which the data is to be visualized or indicated, like the axis used in plotti

TCS INTERNAL USE


The data you are using
present data in clear
Communicate
Understantable

Shapes
Law of closure
Encourasing
Law of common
describing Data thoroughly visualizing the data in a single dashboard.

Discreate
Scatter
Length of bar
Gaps btw bars
Line

Tree map
HeatMap
Mobile number
Find ans to specific qq
Parellel coordinate plot

1 masure and 1 dim


Eclips
Bar Chartplot
A scatter

Infographics
Law of cont
To add Context
Communicate information to a diverse audience.
Probability distribution
Law of Similarity

Sectors
Area Chart
Continous
Scatter plot
Line Chart
90

TCS INTERNAL USE


https://play.fresco.me/course/5

What are the prerequisites to learn D3? HTML,CSS,JS


What is the full form on D3? Data Driven Documentation
D3 is developed by Mike Bostock
What does SVG stand for? Scalable Vector Graphics
D3 is used to create ? SVG

_is or are the main selection in D3 All


What is the correct syntax to draw a circle in cx,Y,r
_is the syntax to read JSON data d3.json
What is the syntax to draw a line in D3 svg.append("line").attr("x1", 0).attr("y1", 200).attr("x2", 100).attr("y2", 100);
______ scale creates linear scale in D3 d3.scaleLinear
_is the correct syntax to generate the simplestScal.axis

var dataset = [ 5, 10, 15, 20, 25 ]; What will 5


What will be the output of scale(300)? 180
Can we group SVG Elements in D3js? 1
Which of the following attribute is being use M 10 25
Which is best way to create stacked chart in Dd3.layout.stack()
Which is not a valid scale in D3 JS? Ordinary
When is the mouseover event raised? Moved over
Which method is used for adding tooltips? div.tip()
What will be output of d3.max() function? 480
Which of the following method can be used t d3.selection.prototype.moveToFront
When trying to select a non existent dom ele Null
What does the following code do ? d3.select('Newly created
Which of these methods help alter zoom mode
Using which method of d3 can we add element d3.append()
d3.ascending (a, b)
Which of the following command is comparator
What are the attributes required to create an "cx", "cy", "rx","ry"
Can we do cross filters through D3js? 1
Which is the correct way to use XML file for Dd3.xml(url[mimeType][,callback])
Which is not a slider available in d3.js? Slider with mid value

TCS INTERNAL USE


("x2", 100).attr("y2", 100);

TCS INTERNAL USE


https://play.fresco.me/course/11

Scatterplot Matrix is an example of __ type of visualization. Multivariant


graphics,ggplot2 packages qualify for ___ Visual
Which command in R, lists the available Datasets in R? Data()
In the plot, to add label to the horizontal axis the variable to be confxlab
Histograms , Density Plots are examples of which type of visualizati Univariant
main
Title of the plot is provided by the variable

Histogram is mainly used as visual representation of _ Distribution


In case of the plot, which parameter decides whether the plot should
Type
How can we find the different parameters available ? par()
Which parameter in R helps to decide the number of bins which we Breaks
The vertical axis label is provided by the parameter ylab lty & pointch
Which parameter decides the line type & plot symbol respectively? lty & pch

To reset the grid to normal layout, after displaying multiple charts i par(mfcol=c(1,1)
Which parameter could be used to convert stacked barplot into gro beside=TRUE
If one would like to visualize 6 graphs in single view with graphs sca par(mrow(2,3))
Customizing the way your graphs could be arranged can be configurlayout()
Barchart represents visual display of of category of categorical varia Freq
Which parameter is helpful to swap the chart from x-axis to y-axis? Horiz = TRUE

The width of line can be changed by lwd=4


Which plot would be applicable for summarizing the value of numeriboxplot
col.main=4, will change size of the font ? 0
If in the Notched boxplot does not overlap, it means___ Median does not overlap
Changing the font size of the axis labels could be accomplished usin font.axis
Which parameter removed the X & Y axis? axis=F
Which feature in R allows to add new graph on top another existing points
Which argument helps in changing the size of plotting characters? cex
Stratified boxplots are useful for examining the relationship between 1

While adding text to graph, adj=0 refers to ? Text will start


Which command allows to place the text within the graph? Text
Which text feature allows the text to written along the margin of th mtext

Lattice package supports the generation of trellis graphs (graphs tha 1


Which of this is limitation of lattice plots? ALL
scatterplot in lattice is represented by _ scatter
In lattice, ~x|A refers to display numeric variable x for each level of factor A.
In lattice,y~x | A*B refers to Display the relationship between numeric variables y and

TCS INTERNAL USE


ggplot2 requires the data to be in Dataframe
In multifaceted plot, how many rows will be formed when facets is Always 1 row
Aesthetics in ggplot2 refers to size,shape,color
In multifaceted plot, how many columns will be formed when facetsAllways 1 col
Using ggplot2 for multifaceted diagram, the deciding variable on th * ~(tilde)
Factors represent the subset of the data and they should be properly 1
Geometric (geom) in ggplot2 refers to size,line,color
The grey zone around the geom("smooth") line refers to 95% confidence zone
What does gg in ggplot2 refer to Grammar of Graphics

ggplot2 allows to add features layer by layer 1


In ggplot2, which function decides the object printing ____? geom_ppoint()
The functions which assist in labelling of the functions is _. ALL

TCS INTERNAL USE


ty & pointch

each level of factor A.


een numeric variables y and x separately for every combination of factor A and B levels

TCS INTERNAL USE


https://play.fresco.me/course/161

The Angles are measured in degrees to construct arcs and pie charts in D3. 0
What is the command to invoke pie layout in D3 ? d3.layout.pie()
While drawing a pie chart from a given data set, the start angle and end angle autom 1
How will you invoke the arc path generator in d3js ? d3.svg.arc()
How many radians is 360 degrees equal to ? 6.28

I am using d3.geo.mercator() code . What is the significance of using mercator in thisMap Projection
__ translated coordinates to pixels . Projection Function
How will you invoke the link method while constructing a tree layout using D3 ? Choo
tree.links(node)
Geographic data stored in Json format is known as _ geojson
Which method of the tree layout reads the data and returns all the objects in form ofNode
A tree layout has hierarchical structure. 1

Which type of charts show the change in a given network structure over a period of Alluvial Charts
_________ project library is an initiative to build re-useable charts for d3 js . NVD3
___ is an interactive Java Script toolkit which is used for creating time series charts. Rickshaw
The goal of NVD3 is to keep the traditional cookie cutter approach to build componeFASLE

TCS INTERNAL USE


https://play.fresco.me/course/197

Independent variables refers to those variables__ Which acts as an input in the experiment.
__ is an example of Multivariate analysis in which relati partial
Pattern such as group or trend in the data table can not bIncorrect
Multivariate data analysis helps us to__ Both
Use of only one variable to describe the data is known as Uni
What are the features of multivariate random variable? both
Dependent variables refers to those variables__ Whose varable analysed
Multivariate data analysis is application of__ ALL
What is multivariate statistics? ALL

Lurking variable remains__ Hidden during the analysis


Amalgamation paradox is also known as Simpson's paradox
What is the drawback of using Kernel density estimation Plot is not smooth
Least number of coordinates required to showcase a poinDimension

What is done when a new data in the sub Interval is add


Height of the bin is increased
Principal component analysis reduces__ Large number of corelated variables
If the area under the PDF curve is zero, then__ Probability = 0
Probability mass function is also known as Probability density function
Stochastic variables are also known as Random variables

What is Random walk? We can always predict the outcome in


advance.
In box kernel density estimation The histogram is centered over the data points.

Conditional probabilty of the event


What is posterior probability? after the evidence is taken into the
consideration!
Features of probability density function are__ All
p(x|θ) is also known as the probability
What is Kernel density estimation? It is the implementation of non -
parametric density estimation.
Characteristics of Markov process are__ Both
What is prior probability? Probability distribution done in lack of
evidence.
What is density estimation? It estimates probability density
function.
If time space or state space is discrete,__ Markov process can be termed as
discrete-time Markov chains
What is kernel? all
We use __ in histogram for sub intervals. bins
What is estimation? both

TCS INTERNAL USE


What is data analysis? all
What is parameter? both
Which estimation can be represented by a single value? all

What is box kernel density estimate? Block in the histogram is centered over
the data points

TCS INTERNAL USE


e experiment.

Bin is inserted from the bottom

ver the data points.

TCS INTERNAL USE


https://play.fresco.me/course/2

a data set around the mean


information on analysis of the data
Which of the following is an arithmetic mid-value?
fifteen be lined up for a photograph
10 members and you need 3
20 males and 8 females

FA, FB) and 60%


10% of patients have

Central Limit Theorem is that


Central limit theorem and central tendency are same things

dentify the variables that are continuous or discrete

normal distribution, but does not


characteristics of normal distribution
Standard Deviation in a Normal Distribution is higher, which of the following is true

correspond to a Binomial Distribution


binomial distribution can be used

modelled using Poisson distribution


Which of the following conditions are satisfied by Poisson random variable?

Any statement whose validity is tested on the basis of a sample is called:


The significance level is the risk of:
The probability of rejecting the null hypothesis when it is true is called:
After a clinical trial, it is concluded that both drugs A and B are equally effective. What type of a hypothesi
If a finding is statistically significant one must also interpret the data, calculate an effect size indicator

A passing student is failed by an examiner, it is an example of


A Type I error is also known as a
If you reject H0 but H0 is true, what type of error has occurred
Which variable represents the actual Type I error?
Type I error occurs when an innocent person is found guilty

TCS INTERNAL USE


purpose of student's t-distributions
The shape of the t-distribution depends upon the:

not the purpose of using chi-square distributions

characteristic of F-distribution curve?

A post hoc test is:


2 or more group means
The ANOVA is a statistical test that is used to compare how many group means?

The dividing point between the region where the null hypothesis is rejected and the region where it is not
A failing student is passed by an examiner, it is an example of:
A good way to get a small standard error is to use a
The use of the laws of probability to make inferences and draw statistical conclusions about populations
Confidence interval become narrow by increasing the:
An advertising agency wants to test the hypothesis that the proportion of adults in a country who read a
In which examples could binomial distribution be used?
The analysis of variance is a statistical test that is used to compare how many group means?
Identify the variables that are continuous or discrete
Which of the following is not the purpose of using chi-square distributions?
There may be times when data is supposed to fit a normal distribution, but does not. Which of the follow
Which of the following conditions are satisfied by Poisson random variable
A statistician calculates a 95% confidence interval for Mean when Standard Deviation is known. The conf
An essential component of the Central Limit Theorem is that
Identify which of the following steps would not be included in hypothesis
A door alarm works in 72 out of 100 cases and surveillance camera works in 68 out of 100 cases. What is
Any hypothesis which is tested for the purpose of rejection under the assumption that it is true is called?
The p-value in statistical significance testing should be used to assess how strong a relationship is. For e

TCS INTERNAL USE


Stander D
descriptive
Mean
1365
120
P = (8/28)(7/27)(6/26)*(5/25)

0.05
0.14

All
0

Time and Weight

Otlier and Small


Mean lies center
Peak is lower

All
Probability

Number of accicdnet
Number success - inDepen

Statistical hypothesis
Rejecting Ho when Ho is correct
Level of significance
Null
1

Type-II error
false positive
Type I
P value
1

TCS INTERNAL USE


When Modling
Degree of freedom

To test how closly

The curve is left sq

Follow up to ANOVA
one way
two or more

Critical value
Type-I error
Large Sample
Inferential statistics
Sample Size
Equal to 25%
Modelling the number of failures in a trial
Two or more
Time and Waigth
To test how closly
Otlier and Small
Number dep Probability of an event in an interval is unpredictable
. 20000
ALL
Eliminate all
0.72 0.49
Null
0

TCS INTERNAL USE


https://play.fresco.me/course/30

Which of the learning methodology applies conditional probabiliSupervised Learning


If you have a basket of different fruit varieties with some prior Supervised Learning

Do you think heuristic for rule learning and heuristics for decis 0
Now Can you make quick guess where Decision tree will fall int Supervised Learning

What is the benefit of Naïve Bayes ? Requires less training data


What is the advantage of using an iterative algorithm like gradieFor Nonlinear regression problems, there is no closed form sol
For which one of these relationships could we use a regression Relationship between Height & weight (both Quantitative)
Does Logistic regression check for the linear relationship bet 0
Which helps SVM to implement the algorithm in high dimensionKernal
Kernel methods can be used for supervised and unsupervised 1
Perceptron is _ a single layer feed-forward neural network
While running the same algorithm multiple times, which algori Hierarchical clustering

Which technique implicitly defines the class of possible pattern Kernel


If the outcome is continuous, which model to be applied? Linear Regression
Which of them, best represents the property of Kernel? Modularity
SVM will not perform well with large data set because (select t training time is high
What are different types of Supervised learning regression and classification
The model in which one estimates the probability that the outcoLogistic Regression
What are the advantages of neural networks (i) ability to learn byALL
One has to run through ALL the samples in your training set to do Gradient Descent
The standard approach to supervised learning is to split the set 1
SVM uses which method for pattern analysis in High dimensionaKernel
Which type of the clustering could handle Big Data? K Means clustering can
The correlation between two variables is given by r = 0.0. . Thi The best straight line would go through the horizontal
The main difficulty with using a regression line to analyze these presence
presence of
of 1
1 or
or more
more outliers
outliers
The main problem with using single regression line Estimate whether the association is linear or non-linear
Consider a regression equation, Now which of the following co
Which model helps SVM to implement the algorithm in high di KernelRFM Analysis
Which of the following is not example of Clustering?
SVM will not perform well with data with more noise because (starget classes could overlap
Objective of unsupervised data covers all these aspect except low-dimensional representations of the data
If the outcome is binary(0/1), which model to be applied? Logistic Regression
Correlation and regression are concerned with the relationship 2 quantitative variables
In Kernel trick method, We do not need the coordinates of the d 1
Which methodology works with clear margins of separation poi Support Vector Machine
In a scenario, where the statistical model describes random err overfitting
The model which is widely used for the classification is Logistic Regression
Which clustering technique requires prior knowledge of the num K Means clustering
Most famous technique used in Text mining is Naive Bayes
Disadvantage of Neural network according to your purview is More nodes to be defined

TCS INTERNAL USE


ems, there is no closed form solution
weight (both Quantitative)

ral network

through the horizontal

on is linear or non-linear

ns of the data

TCS INTERNAL USE


https://play.fresco.me/course/3
Involves working with known information
Which of the following is not applicable to
The process of extracting valid, useful, un Data mining
Statisticians
Which of the following role is responsible f
Which of the following activities is perform Detect Missing Values
Which of the following modelling type shouPredictive Modelling
Noisy values are the values that are valid f 1
What is the other name for Data Preparati ETL

Which statistical technique deals with findi Clustering


If time is used as an independent variable i Successive observations of the dependent variable are uncorrelated
Statistical technique used for investigatin Regression analysis
What is the type of learning where a functi Unsupervised
Decision Tree Learning
Probability of theft in an area is 0.03 wit

Confidence limits
_ are the values that mark the boundaries o
Regression is typically carried out to deve 1
Which of the following activities are perfo ALL
Which of the following are Multi-class Clas Is this movie a comedy, a documentary, or a thriller?
Machine learning task of inferring a functioSupervised Learning
Which is the statistical technique used for Regression analysis
Simulations are carried out to develop a m Clustering 0
Which data mining method groups togetherAffinity
ob analysis
Associate rule is known as _

TCS INTERNAL USE


are uncorrelated

TCS INTERNAL USE


https://play.fresco.me/course/35

What is a preferred distance measure while dealing with sets ?


Unsupervised learning focuses on understanding the data and its underlying pattern 1
Members of the same cluster are far away / distant from each other . 0
Which learning is the method of finding structure in the data without labels. Unsupervised
Each point is a cluster in itself. We then combine the two nearest clusters into one. What type ofAgglomerative
of a set of points is defined using a distance measure .

___ of two points is the average of the two points in Eucledian Space. centroid
The ______ is a visual representation of how the data points are merged to form clusters. Dendo
O(N3)
is the data point that is closest to the other point in the cluster. centroid
___ measures the goodness of a cluster Cohesion
A centroid is a valid point in a non-Eucledian space . 0

K Means algorithm assumes Eucledian Space/Distance 1


___ is a way of finding the k value for k means clustering.
Sampling is one technique to pick the initial k points in K Means Clustering 1
_ is when points don't move between clusters and centroids stabilize. Convergence
Hierarchical Clustering is a suggested approach for Large Data Sets 0
The number of rounds for convergence in k means clustering can be lage 1

What is the R Function to divide a dataset into k clusters ? kmeans()


What is the R function to apply hierarchical clustering to a matrix of distance objects ?
hclust()

TCS INTERNAL USE


Unsupervised
Agglomerative

Convergence

TCS INTERNAL USE


https://play.fresco.me/course/198

The objective function for linear regression is also known as Cost Function. 1
Output variables are known as Feature Variables . fasle
For different parameters of the hypothesis function we get the same hypothesis functfasle
What is the Learning Technique in which the right answer is given for each example inSuper
Hypothesis Function
What is the name of the function that takes the input and maps it to the output variab
Cost function in linear regression is also called squared error function. 1
How are the parameters updates during Gradient Descent Process ? simultentialy
Input variables are also known as feature variables 1
Problems that predict real values outputs are called ? Regression Problem
What is the process of dividing each feature by its range called ? Feature Scaling

function is used as a mapping function for classification problem. Sigmoid


Overfit data has a high bias 0
Where does the sigmoid function asymptote ? 0 AND 1
Lower Decision boundary leads to False Positives during classification 1
Linear Regression is an optimal function that can be used for classification problems. 0
Reducing the number of features can reduce overfitting 1
Classification problems with just two classes are called Binary classification problems. 1
is the line that separates y = 0 and y = 1 in a logistic function. Decision Boundary
What is the range of the output values for a sigmoid function ? [0,1]
Underfit Data has a high variance 1
Overfiting and Underfitting are applicable only to linear regression problems 0
High values of threshold are good for the classification problem. 0
A suggested approach for evaluating the hypothesis is to split the data into training an 1
For the error is determined by getting the proportion of values miss-classified by the Classification
For _ the error is calculated by finding the sum of squared distance between actual a Regression

Problems where discrete valued outputs predicted are called ? Classification


Problems
I have a scenario where my hypothesis fits my training set well but fails to generalize fOverfitting

controls the magnitude of a step taken during Gradient Descent Learning Rate
Mean Normalization
What is the process of subtracting the mean of each variable from its variable called ?
So when a ML Model has high bias, getting more training data will help in improving 0
For an underfit data set the training and the cross validation error will be high 0
For an overfit data set the cross validation error will be much bigger than the training 1
What measures the extent to which the predictions change between various realizati variance
What measure how far the predictions are from the actual values ? Bias
Problems that predict real values outputs are called Regression Problem

TCS INTERNAL USE


hesis Function

sion Problem

on Boundary

Normalization

sion Problem

TCS INTERNAL USE


https://play.fresco.me/course/252

Perform a daily resample/upsample of the data. Do a forward filling of the missing values with limit of 2. What is the value for
16,2011?
How many observations have you seen from Jan 1, 2011 to March 31, 2011?
Perform a monthly resample/downsample of the time series. What is the minimum value for May?
Perform a daily resample/upsample of the data. Do interpolation to fill the data. What is the value for Jan 12, 2011?
Perform a monthly resample/downsample of the time series. What is the maximum value for February?

For the WMT stock open prices time series perform a stationarity test using ADF. What is the p-value?
For the WMT stock open prices time series perform a stationarity test using ADF. What is the value of ADF statistic?
For the XOM stock close prices time series perform a stationarity test using ADF. What is the p-value?
For the XOM stock close prices time series perform a stationarity test using ADF. What is the value of the ADF statistic ?
For the WMT stock open prices time series perform a stationarity test using ADF. How is the time series behaving?

When I upsample my time series and I find many missing values , how do I fill the missing values?
What is the default aggregation function while resampling a time series in pandas?
It is a good practice to apply Forecasting models for non-stationary time series
If the mean and variance of a Time Series is constant over time , it is called a _ Time Series.
I cannot plot resampled Time Series data in Python.
If the p value is > 0.05 during the ADF test of the time series then the series is said to be _
Augmented Dickey-Fuller test cannot be used for identifying if a Time Series is Stationary.
Down sampling is the process of converting a to ____ frequency.
What package in Python provides features to work with Time Zones?
What does freq='T' signify while passing this parameter to the date_range() function ?
In Time Series data , the observations are captured over varying time intervals.
What function in Python helps in creating Date Time index for data that does not have date or time values captured?
What is the function to plot a lag plot for a time series using python?
What is the function to offset the date for daylight saving?
What is the function used for plotting the values of a time series using Python?
In pandas I can combine two time series with different frequencies into a single time series.
I can write my custom aggregation function while resampling my time series in Pandas.
Time Series data is indexed by _?
Auto Correlation Function Plot can be used for determining if a Time Series is stationary or not.
AIC stands for?

TCS INTERNAL USE


15.97
12
16.26
16.09
17.37

0.0028
-3.89
0.99
0.91
It is stationa It is stationary

all
mean()
1
Stationary
1
Non-Stationary
1
low,low
pytz
not create any tmestamp
0
to_datetime()
lag_plot()
Date()
plot.ts()
1
1 0
Datetime Index
1
Akaike information criterion
14

TCS INTERNAL USE


https://play.fresco.me/course/251

Identify the unstructured data from the following both image and video
Which preprocessing technique is used for dimensinality svd

Technique used to evaluate a classifier by dividing the data


cross validation
True Negative is when The predicted instance and the actual instance are negative
True Positive is when The predicted instance and the actual instance are not negative

Which classification techniques involves finding the eig PCA


What is the function that converts K-dimensional vector cSOFTMAX

HOG refers to _ Histogram of Oriented


Gradient
Which of the following is not a characteristics of HOG? Compute gradients in the region are to be described
Which of the following is a feature extraction technique?all
The first layer in a CNN is never a Convolutional Layer 0 1
Which of the following is not an example of CNN archite inseption

The process of changing the pixel intensity values to ach


Image normalisation
Higher value of which of the following hyperparameters is better for decision tree algorithm?

Clustering is a supervised classification 0


SIFT computes the gradient histogram only for patches w 1
The improvement of the image data that suppresses distoall
SVM is a __. supervised learning algorithm
Classification where each data is mapped to more than one Multi Class Classification
The fit(X, y) is used to Train the classifier
The scale-invariant feature transform can be used to dete 1
Choose the correct sequence for classifier building from
Which of the following is not a performance evaluation
SIFT stands for __/ scale-invariant feature transform
High classification accuracy always indicates a good classi 0
The variation present in the PCs decrease as we move fro 1
Which of the following is not a characteristics of HOG Compute
n x n orthonormal matrix region are to be described
gradients in the
In SVD, the matrix A of dimension m x n can be decompos
Which of the following is not a preprocessing technique None
Pruning is a technique associated with Decision
Principal tree
component
The normalized linear combination of the original predictor
Model Tuning helps to increase the accuracy 1 0

Select the correct statements about Nonlinear classificat Kernel tricks are used by Nonlinear classifiers to achieve maximum-m
TF-IDF is a common methodology used in pre-processing 1
The dimensionality reduction technique that efficiently r all
GradientDescent is one of Backward propagation techniqu 1

TCS INTERNAL USE


Which algorithm can be used for matching local regions HOG
HOG is simplified version of SIFT 1
Select the correct option that directly achieves multi-classDecision Tree

TCS INTERNAL USE


instance are negative
instance are not negative

o be described

o be described

ssifiers to achieve maximum-margin hyperplanes

TCS INTERNAL USE


https://play.fresco.me/course/259

An Ideal ACF plot will increase exponentially ? 0


What is the term used to quantify the relationship between the cauto corel
The Partial Auto Correlation Function is useful in detecting the O 1
Non Stationary Time series will have a declining effect ? 0
Partial Auto Correlation is also known as _ Conditional Correlation
What is the property of White Noise in an Auto Regression EquatIt has zero mean and unit standard deviation.
You take a time series and determine the characteristic equation.
What is the process followed to make a specific metric unitless Normalization
What do you get when you divide Auto Covariace of a Time Series Auto Correlation Function
The Auto Correlation Function is Unitless. 1
__ is said to occur when the variance of the unobservable error , Heteroskedasticity
What is the outcome of Model Fitting process for Auto RegressioDetermining the Coefficient
The Auto Regression model can be represented as a Moving Aver 1
What can we say about the time series when the inverse of the l

In a time series , the rate of decay will decide the value of the co Parsimonious
1 Model
A model that is efficient and simple is known as ? Moving Average Model
Time series is a linear combination of white noise process. This i
If there is no decay in the ACF values for any number of lags , whstationary
Exponential smoothing models can be considered as ARIMA mod 1
The coefficient for the residual error terms can be negative for a 1
ARIMA (1,0,0) is equivalent to AR Model
How will you make a non-stationary time series to stationary ? This conversi
For a moving average model , the expectation of the dependent va constant
For an ARMA process ____ ACF and PACF show a geometrically declining trend
In an ARMA(p,q) series , what do p and q represent ? Lag Terms
ARIMA (0,1,0) is equivalent to Random WalMA Model ARIMA (0,1,0) is equivalent
My time series model is predicting well for the available data b Over-fitting
ARIMA (1,0,1) is equivalent to _ ARMA Model
What is the mechanism used to choose optimal p and q for an
For MA(2) process , the Auto Correlation value for the first two l 1
is a form of Auto Regressive model where the coefficient of first la
AR , MA and ARMA models can handle non-stationary time seriesOLS and ML0
What methodology can be used for Estimating the coefficients in
If the ACF follows a geometric decay and the PACF is significant tiA R(p)Series and Independent Variable
Time
Multivariate time series includes lag of _ and _ 1 variable AR(p) process to n variables
What does a p order vector auto regressive process generalize ?
ARIMA (0,0,1) is equivalent to _ MA Model
white noise innovations
What are serially uncorrelated vectors which have variance betwe
What is the range of smoothing constant alpha ? 0 to 1
In vector auto regression , the estimation by ordinary least squar 1
Structural Models have a time component. 1
The coefficient for the residual error terms can be negative for a 1

TCS INTERNAL USE


In a time series , the near terms have a larger effect to the curr 1
In a time series , the rate of decay will decide the value of the co 0
_ is a multivariate generalization of a uni-variate auto regressiv Vector Auto Regression

In exponential smoothing , the weights assigned to lag values sh


Decline

TCS INTERNAL USE


Advanced Time Series Analysis

eclining trend

) is equivalent

TCS INTERNAL USE


https://play.fresco.me/course/239
Both Image and Video clip
Identify the unstructured data from the following
What kind of classification is our case study 'Spam DetectBinary
Which preprocessing technique is used to remove the Stopword removal
Cross-validation technique is used to evaluate a classifier 1

True Positive is when the predicted instance and the actua 1


True Negative is when the predicted instance and the actu 0

Select the correct statements about Nonlinear classificat kernel trick is used in SVM for non-linear classification
What is the tokenized output of the sentence “if you cannOnly', 'do', 'what', 'your', 'heart', 'tells', 'you'
Classification where each data is mapped to more than one Multi-class
The most widely used package for machine learning in pysklearn
In a Term Document Matrix (TDM) each row represents ?document
Cross-validation causes over-fitting 1
In Supervised learning, class labels of the training sampleunkown
TF-IDF is a freature extraction technique 1
Which one of the following is not a classification techniq StratifiedShuffleSplit
counts of unique values in the 'label' column
What does the command sentiment_analysis_data['label']
Inverse Document frequency is used in term document m 1
SVM is supervised learning algorithm.
Which NLP technique uses lexical knowledge base to obtalemmatization
To view the first 3 rows of the dataset, which of the fo sentimentanalysisdata.head(3)
Labeled data
Supervised learning differs from unsupervised learning in
Which numerical statistics is used to identify the import tf-idf
An algorithm that counts how many times a word appearBag-of-words(BOW)

Which of the given hyper parameter(s), when increased m Depth of


Tree
TF and IDF use matrix representations 1
Higher value of which of the following hyperparameters iscan't say
What command should be given to tokenize a sentence i
High classification accuracy always indicates a good classi 0
What is the purpose of lemmatization? To convert words to a proper base form
Which of the following is not a preprocessing method useconfusion_matrix
In a Document Term Matrix (DTM) each row represents ?document
SVM is a
email spam data is an example of Unstructure
Identify the stop words from the following Both "the" and "it"
What kind of classification is the given case study(SentimMulti class classification

TCS INTERNAL USE


['Good',
What is the output of the sentence “Good words bring go'word',
'bring',
'good',
'feeling',
'to', 'the',
'heart']
Lemmatization offers better precision than stemming 1
Clustering is a supervised classification Supervior Learning
In document classification, each document has to be cove 1

TCS INTERNAL USE


ear classification

TCS INTERNAL USE


https://play.fresco.me/course/225

Identify the structured data from the following

What kind of classification is our case study 'Churn Analysis'?


Which command is used to identify the unique values of a column?

Which preprocessing technique is used to make the data gaussian with zero mean and unit variance?

True Negative is when the predicted instance and the actual is positive.
True Positive is when the predicted instance and the actual instance is not negative.
Cross-validation technique is used to evaluate a classifier by dividing the data set into training set to train the classifier and tes

first 3 rows of the dataset


A technique used to depict the performance in a tabular form that has 2 dimensions namely “actual” and “predicted” sets of d
Let's assume, you are solving a classification problem with highly imbalanced class. The majority class is observed 99% of time
Pruning is a technique associated with
Which type of cross validation is used for imbalanced dataset?
Choose the correct sequence for classifier building from the following:
How many new columns does the following command return?

iris_series = pd.get_dummies(iris['Species'])

Download the dataset from: https://gist.githubusercontent.com/curran/a08a1080b88344b0c8a7/raw/


d546eaee765268bf2f487608c537c05e22e4b221/iris.csv to answer the question.

Model Tuning helps to increase the accuracy

The number of categorical attributes in the original dataset.

Download the dataset from: https://gist.githubusercontent.com/curran/a08a1080b88344b0c8a7/raw/


d546eaee765268bf2f487608c537c05e22e4b221/iris.csv to answer the question.
Identify the command used to view the dataset SIZE and what is the value returned?

Is there a class imbalance problem in the given data set?

The commonly used package for machine learning in python is

High classification accuracy always indicates a good classifier


Images,documents are examples of
A classifer that can compute using numeric as well as categorical values is
classification where each data is mapped to more than one class is called
What are the advantages of Naive Bayes?
What kind of classification is the given case study(IRIS dataset)?
The fit(X, y) is used to

TCS INTERNAL USE


email spam detection is an example of
Select pre-processing techniques from the options (check box)
Can we consider sentiment classification as a text classification problem?
A process used to identify data points that are simply unusual
Cross-validation causes over-fitting.

select all NUMERIC types in the dataset.

How many classes will the following command return(target classes in the dataset) : classes=list(iris['species'].unique())
Ensemble learning is used when you build component classifiers that are more accurate and independent from each other.

Which classifier converges easily with less training data?

Cross-validation technique will provide accurate results when the training set and the testing set are from two different popul

Choose the correct sequence from the following:

Imagine you have just finished training a decision tree for spam classication and it is showing abnormal bad performance on b
The number of categorical attributes in the original dataset.
What does the command iris['species'].value_counts() return?
Ordinal variables has
clustering is an example
Which of the following is not a technique to process missing values?
Naive Bayes Algorithm is useful for :

TCS INTERNAL USE


Data from mySQL DB and Excel

Binary
Unique()

Standardization

fasle
1
1

iris.top(3)
Confusion Matrix
For imbalanced class problems, accuracy metric is not a good idea.
Decision tree
K -Fold wrong
Initialize -> Train - -> Predict-->Evaluate

iris.size,(150,5)
No

sklearn
0
Unstructure data
Decision tree wrong
Multi Class Classification
It will converge quicker than discriminative models like logistic regression AND it requires less training data
Multi Class Classification
Train the classifier

TCS INTERNAL USE


Super
All
No
Anomaly Detection
1 wrong

iris_num = iris_data.select_dtypes(include=[numpy.number])
3
1
Naive Bayes Classifier

1 0

Data Analysis -> PreProcessing -> Model Building--> Predict


You need to increase the learning rate galat
1
The total count of elements in iris['species'] column
Ordinal variables has
unsupervised
Imputing classification

None

TCS INTERNAL USE


raining data

TCS INTERNAL USE


https://play.fresco.me/course/169
Bugs and defects no longer occur
Which is NOT a benefit of CI ?
Git is a _ version control system. Distributed
Activities that are part of continuous integration all
Deploy to production
Which is not a CI practice ? Work on user stories
Feature branching is used to Involves moving code in large amounts
Which is NOT true about continuous integration ?
Work Branch is also known as
Release branches are created for resolving merge conflicts 0

Time taken to fix a broken build is measured using __. Build repair rate
Private builds are executed after moving the changes to v On-demand0
Release builds can be triggered NAnt
__ is a .Net build tool. Build scalability
Capablility of your build system to handle an increase in Cyclomatic Complexity Number
Complexity of a code is determined based on ___.
Pipeline break and build break is one and the same 0
_ is a measure of incoming dependencies. Afferent Coupling
Build can be triggered by a version control tool. 1

Feature toggle can be used in the following scenarios . all

Which of these is not a valid build trigger ? version


downstreamconrol
trigger
trigger
Code changes can be hidden,enabled or disabled using all
Private builds can be triggered automatically. 0

Which of the tools is not used for establishing a pipeline

Teamcity Jenkins
Travis CI
CI pipeline consists of stages
Which is the first code analysis to be executed ? unit test
Code Complexity
CCMetrics is used to identify
Commiting a code change, when the inspection fails , is per 0
Teamcity is build tool
Efferent Coupling is a measure of Outgoing dependencies
Select a CI tool based on it's both
is a code coverage tool Nunit
Which of the tools is not mandated for integration in a CI Terraform

TCS INTERNAL USE


Git, Mercurial and Subversion are centralized version cont 0
Which of the following is an artifact repository Nexus
Staged builds include both
What are the reasons for a build to fail ? both
Trunk is also known as Mainline

Arrange in order- A. Running Unit Test B.Running Static BACD

Faster feedback can be received by

Commiting
changes
Staging frequently
Commiting changes
builds frequently
helps in differentiating the environments. Configuration file
Jabber is a Messaging plugin
In a normal scenario, software from is deployed to producMaster branch

How are component dependencies injected in a workflow Parallel Run unit test and build package
trigger
component
s and build
Build packagpackage
Which of these is not mandated for a CI pipeline ? Infrastructu ConfiguratioApplication code
It is a proven fact that reducing the CCN of code is known 1
Does CI pipeline need to have all the software development Cyclomatic1Complexity 0Number
Control flow graph is used to calculate _.

is a continuous integration server.


Teamcity
Ansible

In a normal scenario, software from is deployed to producMaster branch


Component test is executed as part of _. Private Buildall

TCS INTERNAL USE


and build package

TCS INTERNAL USE


https://play.fresco.me/course/32

Based on the hands on card “ OLS in Python Statsmodels” What is the adjusted R sq value
Based on the hands on card “ OLS in Python Statsmodels”What is the value of the constant term ?
Based on the hands on card “ OLS in Python Statsmodels” What is the value of the estimated coef for variable
Based on the hands on card “ OLS in Python Statsmodels” What is the value of R sq ?
Based on the hands on card “ OLS in Python Statsmodels” How many observations are there in the dataset ?

Based on the hands on card “MLR Hands On” What is the P>|t| value for the 'INDUS' variable ?
Based on the hands on card “MLR Hands On” What is the standard error for the constant term ?
Based on the hands on card “MLR Hands On” What is the value of the estimated coef for the constant term ?
Based on the hands on card “MLR Hands On” what is the value of R sq ?
Based on the hands on card “MLR Hands On” Perform a correlation among all the independent variables . What

It is OK to discard theoretical considerations for Statistical Measures


Arithmetic Mean can be used as a prediction measure.
It is advised to go for a simpler model while fitting multiple regression for a dataset
In Multi Variable regression you predict one variable using more than one variable
The SSE depends on the number of observations in the data set
SSE is _ for the Line of Best Fit and _ for the baseline model
What is the formula for root means square error ?
Sum of Squared error is a measure of standard for a Regression Line
By adding multiple variables in Multi Variable Regression , the model accuracy
Regression can show causal relationship between two variables.
pr(>|t|) term signifies how likely the estimated value is zero
What is the good range of correlation values to include in the regression model
__ means predicting one variable from another.
What is the term that represents the difference between actual and predicted value called ?
When two or more variables are correlated in a Multiple Regression Model , it is called as
What is the basic property of the model of best fit ?
What is the process of rescaling the values in the range [0,1]
What is the sum of standard error for the baseline model ?
When more variables are added in Multi Variable Regression the marginal improvement decreases as each variab
What is the process of removing the mean and dividing the value by the standard deviation
It is advised to omit a term that is highly correlated with another while fitting a Multiple Regression Model
R Square Value can be greater than zero
What is the quantity that measures the strength of relationship between two variables ?
__ is a unit less quantity

TCS INTERNAL USE


0.483
9.1021 -34

0.484
506

0.731
5.104
36.42
0.741
-0.76923

11
0 1
1
1
0 1
0
Small , Small
sqrt(SSE/n)
1
Increases
0
1
-1 to +1
Correlate
Differnece
Multi Collinearity
minimize error
normalization
SSE
Law of Incremental Returns
STANDERLIZATION
1
1
Correlation
SSE R

TCS INTERNAL USE


https://play.fresco.me/course/116

Name the color scheme you get by mixing different tones, shades and tints within a specific hue
What is the white space located inside letters like o and p called?
Name the design concept of making items represented, resemble their real-world counterparts
Which one below is a vector image format?
What do you get when you add black to a hue (color)?
The height of a capital letter measured from the baseline
What’s the measure of purity of the color?
Name the design tool that can be used to establish a spatial hierarchy of the content
Name the design language system developed by google
What’s the space between letters called?
The name of the invisible line marking the height of a font.

TCS INTERNAL USE


Monochromatic

Counter
Skeuomorphism

svg
shade
Cap height
Chroma

Material Design

Kerning
Baseline

TCS INTERNAL USE


https://play.fresco.me/course/90

DevOps encompasses culture and collaboration 1


Product Management is one of the 7 DevOps practices 0
Improve quality and performance of the applications is a majo 1
DevOps is the union of ___ , ____ and __ to enable continuousPeople, Processes and Products.
Understanding of what we are deploying and how we are depConfiguration Management
DevOps encompasses _ Automation
____________ is the primary measure of progress according tWorking Software

enables continuous value delivery Automation


What is the desire in the organization to push things to produ Live Site Culture
Automation empowers frequent releases and self-service 1
Implants Manual Errors
Which of the following is not a benefit of Automation in DevOTechnical Debt
Which of the following can slow down your ability to deliver co
____________ measures complexity of the code and the effic Change Lead Time
Which of the following is a benefit of small releases ? Faster Feedback
What is the metric used to measure how quickly are we deploDeployment Frequency
Minimize conflicts when merging code should be one of your 1
Change Fail rate is an Agility performance indicator 0
Feature Flags
_ ensure that code deployed to production environment is not
Continuous Delivery encourages configuration as code. Continuous1Delivery
_ is a software engineering approach in which teams produce so
Continuous Delivery focuses on manual delivery pipeline. 0
During the Continuous Integration process unit tests are execu 1

It is OK if all the environments are not consistent in a DevOps 0


_ is the automated system used for continuous integration (B Build Pipeline
______________ testing tests components together in scenariIntegration
In Manual Integration Testing Human Intelligence is waged aga 1
Treating Infrastructure as a flexible resource is a benefit of I 1

Which of the following is not a type of monitoring ? program


Which of the following is a value of DevOps DevOps goals span the entire delivery pipeline
DevOps is people following a __ enabled by products to delive
The process of monitoring and learning from a “live site” is k Live Monitoring

TCS INTERNAL USE


https://play.fresco.me/course/100

____________ testing was derived from hardware testing. smoke


Capacity testing _. is a critical testing and helps to verify technical aspects
A build can be triggered by a version control tool. Stream Based 1 Version Control
Accurev is a _ system
Collection of executable codes is known as Binnery
A build that fails __ stage will not be deployable. Acceptance
The continuous deployment must be followed to practice continuo 1
Large scale changes to an application can be achieved using __. Branch by Abstraction
Commit Stage
Release candidate is created after the change passes through the
Time taken to fix a broken build is measured using __. Build repair rate
Build tools can be categorised as ____. both
Continuous deployment aims at making feedback loops as short as 1
Build-time dependency should be present when an application is Directed acyclic
0 graph
Dependencies between components must be represented using .
New instance of ___ is created upon every check-in. all
People who do the deployment should be part of creating the de 1

User acceptance testing is done in the stage. Acceptence None


Response time of the requests are verified using ____.
Value delivered by an application is validated through Showcases
Chef is an orchestration tool. 1
User acceptance testing is a verification activity 0
Only positive scenarios are validated in ___ testing. Dependency Hell
Famous problem of dependency management is known as ___. both Dll Hell
___ are known as hot-deployments. zerodown time
Feature toggles is possible in version control systems. 1
Which of the activity/activities are part of continuous integration?all
Automated implementation of an application's build, test, and dep
_ is a paradigm to move code from check-in to production in a co both
Functional testing is a verification activity. 0 1
_ is a central part of continuous deployment. Deployment Pipeline
Environment constitutes all
The capability of your build system to handle an increase in the a Build scalability
Testing done in a customer's environment is know as . beta
In branch by abstraction model, features are developed in Feature Branch
Component is separated from codebase when all
It is advised to perform manual testing throughout the project ph 1
Configuration information can be supplied using . all
_ stage asserts that the system works at the technical level. commit
Number of transactions a system can process in a given timespan,Throughput
A build is triggered whenever there is a change in the _. Downstreamboth

TCS INTERNAL USE


Static code analysis is performed in the _. Commit stage
An environment that is in a properly deployed state is known as Mainline Baseline
The deployment pipeline is initiated by the creation of a release c 0

___ stage takes the delivery team beyond continuous integration.Acceptance


Emergency fixes must go through the deployment pipeline. 1
Disciplined technique for restructuring a code without modifying itRefactoring

Deployment pipeline is an extension of .


Continuous
Integration
MTTR is ____. Mean Time to Repair failures.
is a non-functional test. capacity
Continuous deployment aims at making feedback loops as short as 1
What are the principle(s) that are basic for an effective configu both

TCS INTERNAL USE


lps to verify technical aspects

TCS INTERNAL USE

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy