Softwares
Softwares
Softwares
https://www.oracle.com/in/java/technologies/javase/jdk11-archive-downloads.html
1. Open Above link and click respective Java version for your PC – (Java SE Development
kit 11.0.19)
2. while downloading – it may ask Oracle account, create your own account and
proceed download.
Python download –
https://www.python.org/downloads/release/python-3115/
1. Open the above url and download python 3.11 version ( Make your selecting proper
version of 32 or 64 depending on your machine)
2. While installing make sure you are selecting check box to add path to environment
variables, If you are not able to find checkbox not an issue we can add path
manually later
Anaconda navigator – If you don’t want to install all softwares separately and if you want
install all together in single tool you can download Anaconda navigator – I won’t
recommend this for now. We will see it later.
Pyspark setup:
# Create SparkSession
spark = SparkSession.builder \
.master("local[1]") \
.appName("SparkByExamples.com") \
.getOrCreate()
dataList = [("Java", 20000), ("Python", 100000),
("Scala", 3000)]
df=spark.createDataFrame(dataList,
schema=['Language','fee'])
df.show()