Use Advanced SQL
Use Advanced SQL
Use Advanced SQL
Enable portability
1
Application portability
Application longevity
Reduced dependence on a single vendor
Cross-system communication
SQL Environment
Catalog: a set of schemas that constitute the description of a database.
Schema: the structure that contains descriptions of objects created by a user (base tables, views, constraints …)
DDL: commands that define a database, including creating, altering, and dropping tables and establishing constraints
DML: commands that maintain and query a database.
ACL (Data Control Language): commands that control a database, including administering privileges and
committing data.
Numeric types
- Number(p,q) – general purpose numeric data type
- Integer(p) – signed integer, p digits wide
- Float(p) - floating point in scientific notation with p binary.
2
- Date/Time type - Date - fixed length date/time in dd-mm-yyyy form
3
Data Integrity Controls
Data integrity controls refer to the consistency and accuracy of data that is stored in a database.
Data Integrity validates the data before getting stored in the columns of the table.
Types of data integrity:
- Domain Integrity
- Entity Integrity
- Referential Integrity
Each type of data integrity – domain, Entity, and referential – is enforced with separate types of constraints.
Domain Integrity:
Domain integrity is the validity of entries for a given column.
You can enforce domain integrity by restricting the type (through data types), the format (through CHECK
constraints and rules), or the range of possible values (through FOREIGN KEY constraints, CHECK constraints,
DEFAULT definitions, NOT NULL definitions, and rules).
Entity Integrity:
Entity integrity defines a row as a unique entity for a particular table. We can enforce the Entity Integrity through the
indexes, UNIQUE constraints, PRIMARY KEY constraints, or IDENTITY properties
4
Referential Integrity
Referential integrity maintains the defined relationships between tables when records are entered or deleted.
When you enforce referential integrity, SQL Server prevents users from:
- Adding records to a related table if there is no associated record in the primary table.
- Changing values in a primary table that result in orphaned records in a related table.
- Deleting records from a primary table if there are matching related records.
5
SELECT clause INSERT Statement
6
Insert statement: - Insert statement is used to add/insert new records or from another tables to a particular table.
Delete statement: - Remove/delete row/s from a table.
Update statement: - Modifies data in existing rows
Select statement: - Used to queries from a single or multiple tables.
- Select - lists the column/s that should be returned from the query.
- From - indicate the table/s or view/s from which data will be obtained
- Where - indicate the conditions under which row/s will be included in the result.
- Group by – indicate categorization of results.
- Having – indicate the conditions under which a category (group) will be included.
- Order by – sorts the result according to specified criteria.
Use Aggregate function: - Use aggregate function such as sum, average, product, count, max, min, and so on.
Use Boolean operators: - Use AND, OR, and NOT operators for customizing conditions in where clause.
The SELECT and FROM clauses must be specified. The other clauses are optional.
With the SELECT clause, you specify the name of each column you want to retrieve.
The FROM clause specifies the table that you want to select data from. You can select columns from more than
one table.
You can specify that only one column be retrieved or as many as 8000 columns. The value of each column you
name is retrieved in the order specified in the SELECT clause.
If you want to retrieve all columns (in the same order as they appear in the table's definition), use an asterisk (*)
instead of naming the columns.
Simple selects
The simple select statement select clause which used to specify the columns you want to retrieve, from clause
which used to specify the tables from which the columns are to be retrieved, and the where clause which used to
limits the rows returned by your query.
Here is the basic syntax: SELECT <column_list> FROM <table_list> <[WHERE <search_criteria>]>
8
An Inner Join returns only the rows from each table that have matching values in the join columns.
Any rows that do not have a match between the tables will not appear in the result table.
A Left Outer Join returns values for all of the rows from the first table (the table on the left) and the
values from the second table for the rows that match. Any rows that do not have a match in the second table
will return the null value for all columns from the second table.
A Right Outer Join return values for all of the rows from the second table (the table on the right) and the
values from the first table for the rows that match. Any rows that do not have a match in the first table will
return the null value for all columns from the first table.
A Left Exception Join returns only the rows from the left table that do not have a match in the right table.
Columns in the result table that come from the right table have the null value.
A Right Exception Join returns only the rows from the right table that do not have a match in the left
table. Columns in the result table that come from the left table have the null value.
A Cross Join returns a row in the result table for each combination of rows from the tables being joined (a
Cartesian product).
Subqueries:
A Subquery or Inner query or Nested query is a query within another SQL query, and embedded within the
WHERE clause.
- You use subqueries to break down a complex query into a series of logical steps and, as a result, to solve a
problem with single statements.
- Each select statement in the subquery has its own:
. select list
. where clause
A subquery is used to return data that will be used in the main query as a condition to further restrict the data to be
retrieved.
Subqueries can be used with the SELECT, INSERT, UPDATE, and DELETE statements (or inside another
subquery) along with the operators like =, <, >, >=, <=, IN, BETWEEN etc.
There are a few rules that subqueries must follow:
Subqueries must be enclosed within parentheses.
A subquery can have only one column in the SELECT clause, unless multiple columns are in the main
query for the subquery to compare its selected columns.
9
An ORDER BY cannot be used in a subquery, although the main query can use an ORDER BY. The
GROUP BY can be used to perform the same function as the ORDER BY in a subquery.
Subqueries that return more than one row can only be used with multiple value operators, such as the IN
operator.
The BETWEEN operator cannot be used with a subquery; however, the BETWEEN can be used within the
subquery.
Example: For each department that has more than five employees, retrieve the department number and the number
of its employees who are making more than $40,000.
SELECT DNUMBER, COUNT (*) FROM DEPARTMENT, EMPLOYEE
WHERE DNUMBER=DNO AND SALARY>40000 AND
DNO IN (SELECT DNO FROM EMPLOYEE
GROUP BY DNO
HAVING COUNT (*) > 5)
GROUP BY DNUMBER
Union, Intersect, Except
With the help of set operators, the results of individual table expressions can be combined. This type of
combination is called UNION. SQL supports other set operators besides the UNION operator.
Here is the complete list:
UNION
UNION ALL
INTERSECT
INTERSECT ALL
EXCEPT
EXCEPT ALL
The UNION operator is used to combine the result-set of two or more SELECT statements.
- Each SELECT statement within the UNION must have the same number of columns and similar data types.
10
- Also, the columns in each SELECT statement must be in the same order.
Syntax: SELECT <column_name_list> FROM <table_name> UNION SELECT <column_name_list>
FROM <table_name>
Note: The UNION operator selects only distinct values by default. To allow duplicate values, use UNION ALL.
SELECT
if (selectField1 = true) Field1 ELSE do not select Field1
If the condition is true, statement1 and statement2 is executed; otherwise statement 3 and statement 4 is executed.
The expression or condition is any expression built using relational operators which either yields true or false
condition.
12
. When you want retrieving data using a SELECT statement, you can use logical operators in the WHERE
clause to combine more than one condition.
Example: Retrieve all students’ first name, last name, subject who are studying either ‘Maths’ or ‘Science’.
SELECT first_name, last_name,subject FROM student
WHERE subject = 'Maths' OR subject = 'Science'
floor() : The SQL FLOOR() rounded up any positive or negative decimal value down to the next least
integer
value. SQL DISTINCT along with the SQL FLOOR() function is used to retrieve only unique
value
after rounded down to the next least integer value depending on the column specified.
Example: SELECT floor(17.26)FROM TRAINEE
exp() : The SQL EXP() returns e raised to the n-th power(n is the numeric expression), where e is the base
of
natural algorithm and the value of e is approximately 2.71828183.
14
Example: SELECT EXP(2) AS e_to_2s_power FROM TRAINEE
power() : This SQL POWER() function returns the value of a number raised to another, where both of the
numbers are passed as arguments.
sqrt() : The SQL SQRT() returns the square root of given value in the argument.
SQL character or string function is a function which takes one or more characters as parameters and returns a
character value. Some Character functions are -
lower() : The SQL LOWER() function is used to convert all characters of a string to lower case.
upper(): The SQL UPPER() function is used to convert all characters of a string to uppercase.
As long as your data contains only the date portion, your queries will work as expected.
The following table lists the most important built-in date functions in SQL Server:
- GETDATE()
- DATEPART()
- DATEADD()
- DATEDIFF()
GETDATE() : The GETDATE() function returns the current date and time from the SQL Server.
Syntax: GETDATE()
15
Example: CREATE TABLE Orders(
OrderId int NOT NULL PRIMARY KEY,
ProductName varchar(50) NOT NULL,
OrderDate datetime NOT NULL DEFAULT GETDATE())
DATEPART() : The DATEPART() function is used to return a single part of a date/time, such as year,
month, day, hour, minute, etc.
Syntax: DATEPART(datepart,date)
Syntax: DATEADD(datepart,number,date)
DATEDIFF() : The DATEDIFF() function returns the time between two dates.
Syntax: DATEDIFF(datepart,startdate,enddate)
16
Example: Now we want to get the number of days between two dates. We use the following SELECT statement:
Result: 61
Now we want to get the number of days between two dates (notice that the second date is "earlier" than the first
date, and will result in a negative number). We use the following SELECT statement:
We will get no result! This is because the query is looking only for dates with no time portion.
Tip: If you want to keep your queries simple and easy to maintain, do not allow time components in your dates!
- With the exception of the COUNT (*) function, all aggregate functions return a NULL if no rows satisfy
the WHERE clause. The COUNT (*) function returns a value of zero if no rows satisfy the WHERE clause.
The MAX function Returns the maximum value from expression. Max ignores any NULL values.
Syntax: MAX ( [ALL | DISTINCT] <expression> )
Example: select MAX(Employee.Salary) from dbo. Employee
The MIN function Returns the smallest value from expression. Min ignores any NULL values.
Syntax: MIN ( [ALL | DISTINCT] <expression> )
Example: select MIN(Employee.Salary) from dbo. Employee
The SUM function Returns the total of all values in expression. Sum ignores any NULL values.
Syntax: SUM ( [ALL | DISTINCT] <expression> )
Example: select SUM(Employee.Salary) from dbo. Employee
18
The AVERAGE function Returns the average of the values in expression. The expression must contain
numeric values. Null values are ignored.
syntax: AVG ([ ALL | DISTINCT ] <expression>)
Example: select ID, avg(Employee.Salary) from dbo. Employee
- STDEV: Returns the standard deviation of all values in expression. Stdev ignores any NULL
values. Syntax: STDEV( <expression> )
Example: select STDEV(Employee.Salary) from dbo. Employee
- STDEVP: Returns the standard deviation for the population of all values in expression. Stdevp
ignores any NULL values.
Syntax: STDEVP( <expression> )
Example: select STDEVP(Employee.Salary) from dbo. Employee
- VAR: Returns the variance of all values in expression. Var ignores any NULL values.
syntax: VAR( <expression> )
Example: select VAR(Employee.Salary) from dbo. Employee
- VARP: Returns the variance for the population of all values in expression. Varp ignores any
NULL values. syntax: VARP( <expression> )
Example: select VARP(Employee.Salary) from dbo. Employee
LO3. Write advanced SQL statements that use aggregation and filtering
3.1. SQL - Grouping By Multiple Columns
When an aggregate function is executed, SQL Server summarizes values for an entire table or for groups of
columns within the table, producing a single value for each set of rows for the specified columns.
- You can use aggregate functions with the SELECT statement or in combination with the GROUP BY
clause
19
- Use the GROUP BY clause on columns or expression to organize rows into groups and to summarize those
groups. The GROUP BY clause groups rows on the basis of similarities between them.
When you use the GROUP BY clause, consider the following facts and guidelines:
. SQL Server returns only single rows for each group that you specify; it does not return detail information.
. All columns that are specified in the GROUP BY clause must be included in the select list.
. If you include a WHERE clause, SQL Server groups only the rows that satisfy the search conditions.
. Do not use the GROUP BY clause on columns that contain multiple null values.
Example1: For each department, retrieve the department name, the number of employees in the department, and
their average salary; catagorize by the TraineeID, Departement and name respectively.
SELECT DEPARTEMENT.Dname ,
COUNT (EMPLOYEE.ID)'number of employee',
AVG (EMPLOYEE.salary)'average salary'
FROM EMPLOYEE,DEPARTEMENT
where EMPLOYEE.dnum =DEPARTEMENT.Dnumber
GROUP BY DEPARTEMENT.Dname
Example2: Retrieve all trainees’ TraineeID, Name, Departement , the number of course taken by each trainee and
catagorize by the TraineeID, Departement and name.
20
SELECT <column_name1>, <column_name2> <aggregate_function(s)>
FROM <table_name>
GROUP BY <column_name1>, <column_name2>
1) Usually, a subquery should return only one record, but sometimes it can also return multiple records when used
with operators like IN, NOT IN in the where clause. The query would be like,
21
The output would be similar to:
2) Lets consider the student_details table which we have used earlier. If you know the name of the students who
are studying science subject, you can get their id's by using this query below,
but, if you do not know their names, then to get their id's you need to write the query in this manner,
Output:
id first_name
-------- -------------
100 Rahul
102 Stephen
In the above sql statement, first the inner query is processed first and then the outer query is processed.
3) Subquery can be used with INSERT statement to add rows of data from one or more tables to another table. Lets
try to group all the students who study Maths in a table 'maths_group'.
22
4)A subquery can be used in the SELECT statement as follows. Lets use the product and order_items table defined
in the sql_joins section.
select p.product_name, p.supplier_name, (select order_id from order_items where product_id = 101) as order_id
from product p where p.product_id = 101
Correlated Subquery
A query is called correlated subquery when both the inner query and the outer query are interdependent. For every
row processed by the inner query, the outer query is processed as well. The inner query depends on the outer query
before it can be processed.
NOTE:
1) You can nest as many queries you want but it is recommended not to nest more than 16 subqueries in oracle.
2) If a subquery is not dependent on the outer query it is called a non-correlated subquery.
SQL SubSelect-SubQueries
A sub query or sub select is a select statement that returns a single value
output result and it can be nested inside other subquery or any SELECT, INSERT,
DELETE OR UPDATE statement.
Example:
USE NORTHWIND
SELECT P.PRODUCTNAME,
(SELECT CATEGORYNAME FROM CATEGORIES WHERE CATEGORYID = P.CATEGORYID)
FROM PRODUCTS P
23
Subquery used in the above example returns the category name of each product
in every tuple.
Example:
Here subquery returns product id as single value to the main SQL query.
Example:
SELECT C.CATEGORYNAME,
SOME operators
SOME operator is the same as ANY operator (SOME and ANY perform the same function).
SOME compares a value to each value in a list or results from a query and evaluates to true if the result of an inner
query contains at least one row.
SOME must match at least one row in the subquery and must be preceded by comparison operators.
Example: SELECT ID,Fname,Lname FROM EMPLOYEE
WHERE ID =SOME(SELECT EmpID FROM DEPENDENTED
WHERE Gender='Female');
EXISTS operator
26
The EXIST operator checks the existence of a result of a subquery. The EXISTS subquery tests whether a
subquery fetches at least one row. When no data is returned then this operator returns 'FALSE'.
- A valid EXISTS subquery must contain an outer reference and it must be a correlated subquery.
- You can use the EXISTS and NON EXISTS operators to determine whether data exists in a list of values.
- Use the EXISTS and NOT EXISTS operators with correlated subqueries to restrict the result set of an outer
query to rows that satisfy the subquery.
having clause
Use the HAVING clause on columns or expressions to set conditions on the groups included in a result set.
The HAVING clause sets conditions on the GROUP BY clause in much the same way that the WHERE clauses
interacts with the SELECT statement.
Example: SELECT dnum ,count(*)'number of employee' FROM EMPLOYEE
WHERE exists (SELECT dname FROM DEPARTEMENT
WHERE EMPLOYEE.Dnum =DEPARTEMENT.Dnumber)
group by Dnum
having COUNT(*)>2
4.4. Correlated subqueries
In a SQL database query, a correlated sub-query (also known as a synchronized subquery) is a sub-query (a
query nested inside another query) that uses values from the outer query in its WHERE clause.
- Correlated subquery is one that is executed after the outer query is executed. So correlated subqueries take
an approach opposite to that of the normal subqueries.
27
- In a correlated subquery, the inner query uses information from the outer query and executes once for every
row in the outer query.
- A practical use of a correlated subquery is to transfer data from one table to another.
Syntax for correlated subquery: select column_list from table_name a
where search_condition (select column_list from table_name b
where a.column_name_a=b.column_name_b)
Example: find out the name of all EMPLOYEES who has less or equal to two dependent using correlated
subquery.
The SQL ORDER BY clause allows you to sort the records in your result set. The SQL ORDER BY clause can
only be used in SQL SELECT statements.
SELECT columns
FROM tables
WHERE predicates
ORDER BY column ASC/DESC;
The SQL ORDER BY clause sorts the result set based on the columns specified. If the ASC or DESC value is
omitted, it is sorted by ASC.
SQL ORDER BY
The ORDER BY clause is used in a SELECT statement to sort results either in ascending or descending order.
Oracle sorts query results in ascending order by default.
28
Syntax for using SQL ORDER BY clause to sort data is:
SELECT column-list
FROM table_name [WHERE condition]
[ORDER BY column1 [, column2, .. columnN] [DESC]];
For Example: If you want to sort the employee table by salary of the employee, the sql query would be.
The SQL ORDER BY clause is used to sort the data in ascending or descending order, based on one or more
columns. Some database sorts query results in ascending order by default.
Syntax:
SELECT column-list
FROM table_name
[WHERE condition]
[ORDER BY column1, column2, .. columnN] [ASC | DESC];
You can use more than one column in the ORDER BY clause. Make sure whatever column you are using to sort,
that column should be in column-list.
Description
The ORDER BY clause orders or sorts the result of a query according to the values in one or more specific
columns. More than one columns can be ordered one within another. It depends on the user that, whether to order
them in ascending or descending order. The default order is ascending.
29
The SQL ORDER BY clause is used with the SQL SELECT statement.
Note : SQL ORDER BY clause always come at the end of a SELECT statement.
Syntax
1. SELECT <column_list> FROM < table name >.
2. WHERE <condition>
3. ORDER BY <columns> [ASC | DESC];
30