0% found this document useful (0 votes)

196 views

Pierian Data - Python For Finance & Algorithmic Trading Course Notes

The document provides notes on Python tools and libraries for finance and algorithmic trading, including Anaconda, Jupyter Notebooks, NumPy, Pandas, Matplotlib, and data sources. Key points covered include how to work with data frames in Pandas, handle time series data, perform visualizations in Matplotlib and Pandas, and example code for working with stock market and financial data. Examples of technical analysis indicators like moving averages and Bollinger Bands are also provided.

Uploaded by

Ishan Sane

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

196 views

Pierian Data - Python For Finance & Algorithmic Trading Course Notes

Uploaded by

Ishan Sane

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 11

Pierian Data – Python for Finance & Algorithmic Trading

Course Notes

Anaconda

Jupyter Notebook system

Ipynb files

Nbconvert library for file conversion

 No space in variables
 Tuple (, , ,) is not mutable whereas [, , ,] assignment is changeable
 Set function returns unique assignments
 != inequality check
 Def my_func(): #to define a function
 Append allows you to add at the end of an array
 s.lower() turns string into lower case
 Always ensure (inplace=True)


Python allows you to create anonymous function i.e function having no names using a
facility called lambda function.

lambda functions are small functions usually not more than a line. It can have any
number of arguments just like a normal function. The body of lambda functions is very
small and consists of only one expression. The result of the expression is the value when
the lambda is applied to an argument. Also there is no need for any return statement in
lambda function.

Numpy

 linspace (evenly spaced numbers or linear)

 np.eye(2): identical matrix made of 1 & 0
 np.random rand (provides a random number distribution depending on choice)
 np.ones(10)*5 uses broadcasting to multiply each element by 5
 Why does this create an error a.reshape(3,3) when written on a separate line?
 Slicing formatting required when outputting as matrix
 Mat.sum(axis=0) is the sum of columns
 Mat.sum(axis=1) is the sum of rows
 Np.random(101) apply the same random number set & use Np.random.rand(1) to extract

Multiple large variable assignments will consume ram hence smart variable assignment needed.

 #conditional selection based on,

Pandas

 Panel Data created by Wes McKinney which is an open source library

 Series are like arrays except can be given date time index
 Pandas series can also hold functions
 Axis = 1 refers to columns
 df.drop() default axis set to zero
 Conditional statements return Boolean data frames
 Python’s and operator can only process Boolean values (true/false). Use & instead
 For an or statement use pipe operator ‘|’
 df.set_index() is used to turn an array into an index
 Multi-level index calling; df.loc[‘G1’].loc[1] (nested indexing)
 df.loc[‘G2’].loc[2][B] (specifies a particular column B)
 df.xs(1,level = ‘Num’) returns row 1 from column name Num
 Dealing with missing data:
 df.dropna() by default drops any rows with null values, df.dropna(1) will drop any columns
with null values
 df.fillna(value = ‘fill value’)
 Groupby allows for aggregation based off a column
 Finding unique values in a data frame. Df[‘colname’].unique returns unique values in an
array
 Data I/O: csv, excel, html, sql files can be linked
 to_csv allows to write to files (to_excel(‘file name.xlsx’,sheet_name=’newsheet’))
 pandas can only import data not macros or formulas
 DataFrame.columns (returns column names)
 DataFrame[‘column name’].nunique() = returns number of unique objects in a certain
column. nunique without brackets returns the full list

 banks.groupby("ST").count().sort_values('Bank Name',ascending=False).iloc[:5]['Bank
Name']
o uses a nested iloc function to return a sub set of values and the sort_values function
to rank
o .count() method counts
 banks['Acquiring Institution'].value_counts()
o the value_counts() method counts the number of times a certain value occurs in a
specified Data Frame column
 banks[banks['Acquiring Institution']=='State Bank of Texas']
o Nesting like the above within data frames can be used to selectively pull certain data
that full-fills a criteria such as only pulling up entries that are where the acquiring
institution was the State Bank of Texas
 banks[banks['ST']=='CA'].groupby('City').count().sort_values('Bank
Name',ascending=False).iloc[:1]
o can count within a particular subset such as number of banks for cities within a
particular state
o
o
 sum(banks['Bank Name'].apply(lambda name: 'Bank' not in name))
o
 sum(banks['Bank Name'].apply(lambda name: name[0].upper() == 'S'))
o finds any banks that start with capital S. name[0] returns the starting position of
name
o
 sum(banks['Bank Name'].apply(lambda name: len(name.split())==2))
o split () function, splits a string based off what is specified, if blank splits it based on
empty space
o the len function checks to see whether 2 name objects were returned or not


Matplotlib and Pandas – Visualisation

 Matplotlib has 2 API structures

o Object oriented structure & Function oriented structure
o Gallery has good examples
 Function oriented
o import matplotlib.pyplot as plt
o %matplotlib inline (for jupyter workbook)
o plt.plot(x,y,'b')
o plt.title('my first python chart')
o plt.xlabel('Jerrod')
o plt.ylabel('prpasfqf')
 Object oriented
o Fig = plt.figure()
o Axes = fig.add_axes([0.1,0.1,0.8,0.8])
o Axes.plot(x,y,’b’)
o axes.set_xlabel('Set X Label') # Notice the use of set_ to begin methods
o axes.set_ylabel('Set y Label')
o axes.set_title('Set Title')
 Matplotlib allows the aspect ratio, DPI and figure size to be specified when the Figure object
is created. You can use the `figsize` and `dpi` keyword arguments.
 * `figsize` is a tuple of the width and height of the figure in inches
 * `dpi` is the dots-per-inch (pixel per inch).
 For example:
 Matplotlib allows the aspect ratio, DPI and figure size to be specified when the Figure object
is created. You can use the figsize and dpi keyword arguments.
 figsize is a tuple of the width and height of the figure in inches
 dpi is the dots-per-inch (pixel per inch).


 ax2.set_xlim(20,22) & ax2.set_ylim(30,50). With these commands you can set the x & y
limits for each axis to zoom in on a section of a plot
 axes[0].plot(x,y,color="red",lw=5,ls=":"). Can be used to adjust the style and colour of the
chart
 fig,axes=plt.subplots(1,2,figsize=(12,2)). Figsize object allows the user to code the size of the
figure

Pandas Visualisation

 %matplotib inline
 import numpy as np; import pandas as pd; %matplotlib inline
 import matplotlib.pyplot as plt

There are several plot types built-in to pandas, most of them statistical plots by nature:

 df.plot.area
 df.plot.barh
 df.plot.density
 df.plot.hist
 df.plot.line
 df.plot.scatter
 df.plot.bar
 df.plot.box
 df.plot.hexbin
 df.plot.kde
 df.plot.pie

You can also just call df.plot(kind='hist') or replace that kind argument with any of the key terms
shown in the list above (e.g. 'box','barh', etc..)
 idx returns the index
 %matplotlib notebook (makes the plot interactive)
 df3['a'].plot.hist(color="blue",bins=100,alpha=0.5)
 df3[['a','b']].plot.box()


Data Sources

 Pandas data-reader (Google’s stock API)

 Quandl (robust python API). Need a user key for access more than 50 times a day

Yahoo and google have changed their APIs and are sometimes unstable. Use the codes "iex" or
"morningstar" instead

 Calls to commence pandas data reader

Import pandas_datareader.data as web
import datetime
 datetime.datetime(2015,1,1) records the time object
 E.g. facebook = web.DataReader(‘FB’,’google’,start,end)
 Returns data into a data frame
 For Options;
from pandas_datareader.data import Options
fb_options = Options(‘FB’,’google’)

Quandl

 Import quandl
mydata = quandl.get(‘EIA/PET_RWTC_D’)

Pandas with Time Series Data

 DateTime index, Time Resampling, Time Shifts, Rolling & Expanding

 From datetime import datetime (Python’s in-built date time library). Defaults to 0 hrs 0 mins
 Df[‘name’] = pd.to_datetime(df[‘name’])
 Df.resample(rule=’A’).mean() This will do an annual mean of data set
 Df.tshift allows for grouping by a certain time period frequency
 Moving average or rolling mean can be computed using df.rolling object
(df.rolling(7).mean().head(14))
 Plotting a 30 day MA on stock price would use the following
df.rolling(window=30).mean()[‘Close’].plot()
 Expanding() object returns the cumulative average

Bollinger Bands

 Volatility bands which are placed above or below the moving average line
 20 day means are used
 .std() returns the standard deviation of data set


Capstone Project: Stock Market

 Ford['Volume'].idxmax(). Returns the index value of the maximum value in the ‘Volume’
column
 gm['MA50'] = gm['Open'].rolling(50).mean() & gm['MA200'] =
gm['Open'].rolling(200).mean(). Computes moving averages
 df.index = pd.to_datetime(df.index). Converts index to datetime object
 car_comp = pd.concat([tesla['Open'],gm['Open'],ford['Open']],axis=1)

Candlestick Chart Code

from matplotlib.finance import candlestick_ohlc

from matplotlib.dates import DateFormatter, date2num, WeekdayLocator, DayLocator, MONDAY

# Rest the index to get a column of January Dates

ford_reset = ford.loc['2012-01':'2012-01'].reset_index()

# Create a new column of numerical "date" values for matplotlib to use

ford_reset['date_ax'] = ford_reset['Date'].apply(lambda date: date2num(date))

ford_values = [tuple(vals) for vals in ford_reset[['date_ax', 'Open', 'High', 'Low', 'Close']].values]

mondays = WeekdayLocator(MONDAY) # major ticks on the mondays

alldays = DayLocator() # minor ticks on the days

weekFormatter = DateFormatter('%b %d') # e.g., Jan 12

dayFormatter = DateFormatter('%d') # e.g., 12

#Plot it

fig, ax = plt.subplots()

fig.subplots_adjust(bottom=0.2)

ax.xaxis.set_major_locator(mondays)

ax.xaxis.set_minor_locator(alldays)

ax.xaxis.set_major_formatter(weekFormatter)

candlestick_ohlc(ax, ford_values, width=0.6, colorup='g',colordown='r');

 gm['returns'] = gm['Close'].pct_change(1). This method allows for computing of percent

change from one value to another off a certain column
 GM['returns'].plot(kind="kde",label="GM"). This plots a density distribution curve
 box_df=pd.concat([tesla['returns'],Ford['returns'],GM['returns']],axis=1)
box_df.columns=['Tesla Returns','Ford Returns','GM Returns']
box_df.plot(kind="box",figsize=(8,11)). This plots a box plot of the data set based on 3
columns.


Statsmodels

Statistics
 ETS Models: refer to Error Trend Seasonality models that will take each of those terms for
smoothing. It breaks up data into the following components:
o Trend
o Seasonality
o Residual

from statsmodels.tsa.seasonal import seasonal_decompose

result = seasonal_decompose(airline['Thousands of Passengers'],model='multiplicative')
result.plot()

 EWMA Models (Error Weighted Moving Average): reduce time lag, more weight applied to
more recent values

EWMA

Exponentially-weighted moving average

We just showed how to calculate the SMA based on some window.However, basic SMA has some
"weaknesses".

•Smaller windows will lead to more noise, rather than signal

•It will always lag by the size of the window

•It will never reach to full peak or valley of the data due to the averaging.

•Does not really inform you about possible future behaviour, all it really does is describe trends in
your data.

•Extreme historical values can skew your SMA significantly

To help fix some of these issues, we can use an EWMA (Exponentially-weighted moving average).

EWMA will allow us to reduce the lag effect from SMA and it will put more weight on values that
occured more recently (by applying more weight to the more recent values, thus the name). The
amount of weight applied to the most recent values will depend on the actual parameters used in
the EWMA and the number of periods given a window size. Full details on Mathematics behind
this can be found here Here is the shorter version of the explanation behind EWMA.
The formula for EWMA is:

Where x_t is the input value, w_i is the applied weight (Note how it can change from i=0 to t), and
y_t is the output.

Now the question is, how to we define the weight term w_i ?
This depends on the adjust parameter you provide to the .ewm() method.

When adjust is True (default), weighted averages are calculated using weights:

ARIMA

The general process for ARIMA models is the following:

 Visualize the Time Series Data
 Make the time series data stationary
 Plot the Correlation and Auto Correlation Charts
 Construct the ARIMA Model
 Use the model to make predictions

ARIMA: is a generalisation of the ARMA (autoregressive moving average model)

o Seasonal & Non-seasonal ARIMA

o Non-seasonal ARIMA: p,d,q
o P: Autoregression
o I: Integration
o Q: Moving average
o Stationary data: has constant mean & variance over time (co-variance should not be
a function of time, how fast variance moves over time)
 There are tests for stationarity in data. A common one is the Augmented Dickey Fuller Test
 Differencing (first order: computing difference, 2 nd order: adding those up sacrificing one row
of data in the process)
 For seasonal data, you could difference by 12 rather than 1 (shift.12 method does that)

Stationarity Tests

Uses the Augmented Dickey Fuller Test with Statsmodels

 Has a null hypothesis that it’s not stationary?

 Unit root test
 P value returned dictates whether it’s a stationary or non-stationary data set

In statistics and econometrics, an augmented Dickey–Fuller test (ADF) tests the null hypothesis
that a unit root is present in a time series sample. The alternative hypothesis is different
depending on which version of the test is used, but is usually stationarity or trend-stationarity.
Basically, we are trying to whether to accept the Null Hypothesis H0 (that the time series has a
unit root, indicating it is non-stationary) or reject H0 and go with the Alternative Hypothesis (that
the time series has no unit root and is stationary).
We end up deciding this based on the p-value return.
 A small p-value (typically ≤ 0.05) indicates strong evidence against the null hypothesis,
so you reject the null hypothesis.
 A large p-value (> 0.05) indicates weak evidence against the null hypothesis, so you fail
to reject the null hypothesis.
Let's run the Augmented Dickey-Fuller test on our data:

Differencing

First difference of a time series is the series of changes from one period to the next (pandas can
handle this)

Can continue to take 2nd & 3rd difference and keep going until data reaches stationarity

ACF & PACF

 Autocorrelation plot shows the correlation of the series with itself lagged by x time units
 These plots are usually run on the differenced/stationary data
 The ACF plots will also determine whether the AR or MA functions will be used in the ARIMA
model
o If the ACF Plot shows positive autocorrelation at the first lag (-1), AR is suggested for
use
o If the ACF plot shows negative autocorrelation at the first lag (-1) then it suggests
using MA terms

Partial Autocorrelation

 Partial autocorrelation is a conditional correlation between 2 variables under the

assumption that we know and account for the values of some other set of variables
Finance Fundamentals

Portfolio Allocation

Sharpe Ratio

(R p−R f )
 S=
σp
 Sigma-p is the Portfolio standard deviation
 If the risk-free rate = 0% then Sharpe Ratio is simplified to Mean Return divided by Std.
Deviation
 Can also be applied to compute average of yearly over daily returns
 K-factor based off your sampling rate
o K = sqrt(252)
o K = sqrt(52)
o K = sqrt(12)
 ASR = K*SR


Portfolio Optimisation

Efficient Frontier

CAPM – Capital Asset Pricing Model





Markowitz efficient portfolio optimisation

Monte Carlo Simulation

 Stocks.pct_change(1).corr(). Plots the Pearson

 Log returns are used for normalising more advanced time series data as this normalises &
de-trends the time series

Financial Markets Knowledge

 Order Book: is an electronic list of buy & sell orders for an instrument organised by price
level. Number of products being bid/offered by each price point or market depth. It also
identifies the market participants but some can remain anonymous. Order imbalances may
also become apparent & point toward a certain market trend. The order book does not
show the activity/batches of ‘dark pools’


CFA Level 1 Notes
75% (20)
CFA Level 1 Notes
154 pages
Solution Manual For Design and Analysis of Experiments 9th Edition - Douglas C. Montgomery
No ratings yet
Solution Manual For Design and Analysis of Experiments 9th Edition - Douglas C. Montgomery
25 pages
Naked Money
100% (1)
Naked Money
341 pages
DADM Assignment 2 - NYReform
No ratings yet
DADM Assignment 2 - NYReform
14 pages
Pandas Basics
No ratings yet
Pandas Basics
84 pages
Python Cheat Sheet: Pandas - Numpy - Sklearn Matplotlib - Seaborn BS4 - Selenium - Scrapy
100% (3)
Python Cheat Sheet: Pandas - Numpy - Sklearn Matplotlib - Seaborn BS4 - Selenium - Scrapy
9 pages
Sub026 - Grain Trade Australia
No ratings yet
Sub026 - Grain Trade Australia
38 pages
Class 08 Matlab 07
No ratings yet
Class 08 Matlab 07
14 pages
2020 Mock Exam A - Afternoon Session (With Solutions)
No ratings yet
2020 Mock Exam A - Afternoon Session (With Solutions)
62 pages
Financial Analytics With Python
100% (1)
Financial Analytics With Python
40 pages
unit-3(FODS)
No ratings yet
unit-3(FODS)
34 pages
pandas (1)
No ratings yet
pandas (1)
25 pages
Python For Statistics
No ratings yet
Python For Statistics
40 pages
ip study
No ratings yet
ip study
18 pages
Python Unit IV
No ratings yet
Python Unit IV
12 pages
Pandas
No ratings yet
Pandas
5 pages
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
100% (1)
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
12 pages
NumPy and Pandas (1)
No ratings yet
NumPy and Pandas (1)
12 pages
Ex1_Plotting and Visualization using Numpy and Pandas
No ratings yet
Ex1_Plotting and Visualization using Numpy and Pandas
14 pages
Python Pandas and Matplotlib 7
100% (3)
Python Pandas and Matplotlib 7
72 pages
2,3. Introduction Pandas & Matplotlib - Copy
No ratings yet
2,3. Introduction Pandas & Matplotlib - Copy
32 pages
Usage of NumPy for Numerical Data in Detail
No ratings yet
Usage of NumPy for Numerical Data in Detail
52 pages
Unit 5 PythonPackages(Matplotlib)
No ratings yet
Unit 5 PythonPackages(Matplotlib)
24 pages
unit 5
No ratings yet
unit 5
28 pages
Pandas DataFrame Notes
67% (3)
Pandas DataFrame Notes
13 pages
BDA File
No ratings yet
BDA File
26 pages
Pandas Numpy
No ratings yet
Pandas Numpy
4 pages
Chapter 5 - Data Exploration and Visualization With
No ratings yet
Chapter 5 - Data Exploration and Visualization With
39 pages
Data Analytics Using Python
No ratings yet
Data Analytics Using Python
22 pages
PP&DS UNIT III
No ratings yet
PP&DS UNIT III
26 pages
Pandas PDF(2)
No ratings yet
Pandas PDF(2)
25 pages
Summary: Introduction To Data Visualization Tools
No ratings yet
Summary: Introduction To Data Visualization Tools
13 pages
Pandas Complete + Visualisation Summary of IBM Visualization
No ratings yet
Pandas Complete + Visualisation Summary of IBM Visualization
21 pages
Python Libraries
No ratings yet
Python Libraries
27 pages
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
100% (1)
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
10 pages
Pandas DataFrameObject
No ratings yet
Pandas DataFrameObject
4 pages
Pandas
No ratings yet
Pandas
12 pages
Pandas
No ratings yet
Pandas
36 pages
Fundamentals of Data Science Lab Manual
No ratings yet
Fundamentals of Data Science Lab Manual
34 pages
3rd Semester DDM AI DAA DEV Print Pages For Spiral Record 25-1-24 - Removed
No ratings yet
3rd Semester DDM AI DAA DEV Print Pages For Spiral Record 25-1-24 - Removed
28 pages
05Getting Started With Pandas
No ratings yet
05Getting Started With Pandas
44 pages
Mohit
No ratings yet
Mohit
19 pages
Data Analysis Tools
No ratings yet
Data Analysis Tools
26 pages
NumPy, Pandas, MatplotLib,Seaborn, ScikitLearn (SkLearn)
No ratings yet
NumPy, Pandas, MatplotLib,Seaborn, ScikitLearn (SkLearn)
14 pages
EDA Document
No ratings yet
EDA Document
13 pages
FOD Record Sem 1
No ratings yet
FOD Record Sem 1
25 pages
dav 2 unit
No ratings yet
dav 2 unit
55 pages
Pandas
No ratings yet
Pandas
29 pages
EXP1-siddhant gupta (23_SE_148)
No ratings yet
EXP1-siddhant gupta (23_SE_148)
17 pages
Pandas DataFrame Notes
No ratings yet
Pandas DataFrame Notes
13 pages
DV Lab2 Updated
No ratings yet
DV Lab2 Updated
12 pages
Lab-3 Pandas Library
No ratings yet
Lab-3 Pandas Library
14 pages
Pandas,Numpy,Matplotlib
No ratings yet
Pandas,Numpy,Matplotlib
11 pages
NumPy and Pandas Tutorial
No ratings yet
NumPy and Pandas Tutorial
8 pages
Course_ Introduction to Data Science (SD211105)
No ratings yet
Course_ Introduction to Data Science (SD211105)
10 pages
Unit 4
No ratings yet
Unit 4
27 pages
Python Pandas Tutorial
No ratings yet
Python Pandas Tutorial
45 pages
DevOps Session 3 Pandas.pptx
No ratings yet
DevOps Session 3 Pandas.pptx
33 pages
Python for ML
No ratings yet
Python for ML
41 pages
lec19
No ratings yet
lec19
14 pages
Pandas DataFrame Notes
100% (1)
Pandas DataFrame Notes
10 pages
Unit 4
No ratings yet
Unit 4
36 pages
Python
No ratings yet
Python
32 pages
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
CBOT-Understanding Basis
No ratings yet
CBOT-Understanding Basis
26 pages
The Future Predictions of The Blind Mystic
No ratings yet
The Future Predictions of The Blind Mystic
13 pages
19-00351 DATA61 REPORT AgricultureWorkforce WEB 191031
No ratings yet
19-00351 DATA61 REPORT AgricultureWorkforce WEB 191031
80 pages
Backwardation Returns Commodity Fut
No ratings yet
Backwardation Returns Commodity Fut
30 pages
Crown Sydney Cirq Food Menu
No ratings yet
Crown Sydney Cirq Food Menu
3 pages
The Vessel Scheduling Problem in A Liner Shipping
No ratings yet
The Vessel Scheduling Problem in A Liner Shipping
17 pages
The Nomenclature of Jewelry Part 3 - Rings - International Gem Society IGS
No ratings yet
The Nomenclature of Jewelry Part 3 - Rings - International Gem Society IGS
4 pages
Chickpea Marketing India
No ratings yet
Chickpea Marketing India
19 pages
Mahesh Gowande: Contact
No ratings yet
Mahesh Gowande: Contact
2 pages
The Nomenclature of Jewelry Part 1 - Settings - International Gem Society IGS
No ratings yet
The Nomenclature of Jewelry Part 1 - Settings - International Gem Society IGS
9 pages
SeniorResearchAnalystCOFCO MichaelMosca
No ratings yet
SeniorResearchAnalystCOFCO MichaelMosca
3 pages
ACF Supply and Demand Report - October 18
No ratings yet
ACF Supply and Demand Report - October 18
6 pages
Making Money Investing in Gems - International Gem Society IGS
No ratings yet
Making Money Investing in Gems - International Gem Society IGS
9 pages
Continuous Futures Data Series For Back Testing and Technical Analysis
No ratings yet
Continuous Futures Data Series For Back Testing and Technical Analysis
6 pages
Commodity Trading Goes Back To The Future
No ratings yet
Commodity Trading Goes Back To The Future
10 pages
Alteryx Inspire Conference
No ratings yet
Alteryx Inspire Conference
3 pages
Sample: For Your Information
No ratings yet
Sample: For Your Information
28 pages
Education Lesson Inventory: Courses
No ratings yet
Education Lesson Inventory: Courses
21 pages
Performance Management Procedure
No ratings yet
Performance Management Procedure
6 pages
A Quantitative Analysis of Managed Futures Strategies: Lintner Revisited
No ratings yet
A Quantitative Analysis of Managed Futures Strategies: Lintner Revisited
40 pages
Lecture 12
No ratings yet
Lecture 12
21 pages
Modeling Indian Bank Nifty Volatility Using Univariate GARCH Models
No ratings yet
Modeling Indian Bank Nifty Volatility Using Univariate GARCH Models
13 pages
Science Research Iii: Second Quarter-Module 7 Hypothesis Testing For The Means - Two Sample (Dependent Sample)
No ratings yet
Science Research Iii: Second Quarter-Module 7 Hypothesis Testing For The Means - Two Sample (Dependent Sample)
9 pages
Lesson Plan
No ratings yet
Lesson Plan
12 pages
Branton & Richardson
No ratings yet
Branton & Richardson
12 pages
Marketing Analytics
No ratings yet
Marketing Analytics
9 pages
2023 Checking Influence of Cognitive Biases On Investment Decision Making and Moderating Role of Financial Literacy
No ratings yet
2023 Checking Influence of Cognitive Biases On Investment Decision Making and Moderating Role of Financial Literacy
13 pages
Process Lethality Spreadsheet - 2
No ratings yet
Process Lethality Spreadsheet - 2
8 pages
Download Study Resources for Miller and Freunds Probability and Statistics for Engineers 9th Edition Johnson Solutions Manual
100% (17)
Download Study Resources for Miller and Freunds Probability and Statistics for Engineers 9th Edition Johnson Solutions Manual
56 pages
Source Code Attractiveness PDF
No ratings yet
Source Code Attractiveness PDF
10 pages
Hennart Reddy
No ratings yet
Hennart Reddy
12 pages
Hypothesis Testing: Cee 3040 - Uncertainty Analysis in Engineering
No ratings yet
Hypothesis Testing: Cee 3040 - Uncertainty Analysis in Engineering
1 page
SB Mock Test _ FOW 11
No ratings yet
SB Mock Test _ FOW 11
18 pages
Stroop Effect
No ratings yet
Stroop Effect
15 pages
Role of Zinc in Patients With Nephrotic Syndrome: Research Article
No ratings yet
Role of Zinc in Patients With Nephrotic Syndrome: Research Article
7 pages
Lit - Review-Arslan Asghar-IFE-18874-4B-1
No ratings yet
Lit - Review-Arslan Asghar-IFE-18874-4B-1
13 pages
Bailey DCS Simulator API: Previse
No ratings yet
Bailey DCS Simulator API: Previse
33 pages
200 Oxford Mcqs For Ent
100% (2)
200 Oxford Mcqs For Ent
89 pages
Practice Test 3 Bus2023 Spring09 Solutions
No ratings yet
Practice Test 3 Bus2023 Spring09 Solutions
15 pages
Effects of Brainstorming On Students' Achievement in Senior Secondary Chemistry
No ratings yet
Effects of Brainstorming On Students' Achievement in Senior Secondary Chemistry
8 pages
Linear Regression Models Applications In R John P Hoffman instant download
No ratings yet
Linear Regression Models Applications In R John P Hoffman instant download
83 pages
Buss 1020 Assessment.
No ratings yet
Buss 1020 Assessment.
9 pages
Problem
No ratings yet
Problem
1 page
Chapter 4 (Hypothesis Testing)
No ratings yet
Chapter 4 (Hypothesis Testing)
20 pages
Youtube As A Learning Space: Impact On Learners' Digital Literacy and Satisfaction Context and Rationale
No ratings yet
Youtube As A Learning Space: Impact On Learners' Digital Literacy and Satisfaction Context and Rationale
32 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Pierian Data - Python For Finance & Algorithmic Trading Course Notes

Uploaded by

Pierian Data - Python For Finance & Algorithmic Trading Course Notes

Uploaded by

Pierian Data – Python for Finance & Algorithmic Trading

Jupyter Notebook system

Nbconvert library for file conversion

 linspace (evenly spaced numbers or linear)

 #conditional selection based on,

 Panel Data created by Wes McKinney which is an open source library

Matplotlib and Pandas – Visualisation

 Matplotlib has 2 API structures

 Pandas data-reader (Google’s stock API)

 Calls to commence pandas data reader

Pandas with Time Series Data

 DateTime index, Time Resampling, Time Shifts, Rolling & Expanding

Capstone Project: Stock Market

Candlestick Chart Code

from matplotlib.finance import candlestick_ohlc

from matplotlib.dates import DateFormatter, date2num, WeekdayLocator, DayLocator, MONDAY

# Rest the index to get a column of January Dates

# Create a new column of numerical "date" values for matplotlib to use

ford_reset['date_ax'] = ford_reset['Date'].apply(lambda date: date2num(date))

ford_values = [tuple(vals) for vals in ford_reset[['date_ax', 'Open', 'High', 'Low', 'Close']].values]

mondays = WeekdayLocator(MONDAY) # major ticks on the mondays

alldays = DayLocator() # minor ticks on the days

weekFormatter = DateFormatter('%b %d') # e.g., Jan 12

dayFormatter = DateFormatter('%d') # e.g., 12

candlestick_ohlc(ax, ford_values, width=0.6, colorup='g',colordown='r');

 gm['returns'] = gm['Close'].pct_change(1). This method allows for computing of percent

from statsmodels.tsa.seasonal import seasonal_decompose

Exponentially-weighted moving average

•Smaller windows will lead to more noise, rather than signal

•It will always lag by the size of the window

•Extreme historical values can skew your SMA significantly

The general process for ARIMA models is the following:

ARIMA: is a generalisation of the ARMA (autoregressive moving average model)

o Seasonal & Non-seasonal ARIMA

Uses the Augmented Dickey Fuller Test with Statsmodels

 Has a null hypothesis that it’s not stationary?

ACF & PACF

 Partial autocorrelation is a conditional correlation between 2 variables under the

CAPM – Capital Asset Pricing Model

Markowitz efficient portfolio optimisation

Monte Carlo Simulation

 Stocks.pct_change(1).corr(). Plots the Pearson

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.