Ciencia Datos Corner
Ciencia Datos Corner
MODULAR DEVELOPMENT
AIM
Develop computer applications to perform basic data processing with the Python language,
identifying methods for exploring large pools of data and relational and non-relational data
management systems (NoSQL).
LEARNING OUTCOMES
• Use of NoSQL DBs and new data models (structured and unstructured)
- Fundamentals of the NoSQL paradigm
- Data distribution and parallel processing
- Main data models in the NoSQL world: key-value, orientation to
documents, property graphs, knowledge graphs
• Identification and analysis of complex problems in the area of data analysis and
solution approach
- Main concepts of data processing flows in large-scale systems
volume
- Main phases of managing large volumes of data and associated challenges
- Data engineer roles in the main phases of data management
- Main limitations of traditional data management models
- New data models
• Effectiveness in solving complex problems in the development of knowledge to design prototypes of software solutions in
Python in phases without losing sight of the complexity of the global problem.
• Identification of the tools to be applied and their cost and the needs of the data cycle
required.
• Development of a positive attitude towards learning and continuous improvement, with the objective of knowing and reviewing
the suppliers of the tools and the installation and updating methods.
• Demonstration of initiative and autonomy in the presentation of prototypes and discussion of problems and solutions to be
discussed in a group, reviewing requirements and their costs.
AIM
Identify data management principles for a project with multiple input sources and apply data
model organization techniques from a logical and physical point of view.
LEARNING OUTCOMES
• Demonstration of a critical attitude of strategic thinking, presenting data processing schemes and
allowing discussion with interest groups internal and external to the company to formulate future-
oriented actions.
• Development of design and data analysis activities with social responsibility, intellectual honesty and
scientific integrity.
• Awareness of the need for a responsible attitude committed to results and the limitation of available
resources when making decisions in complex professional environments.
AIM
Apply the fundamentals of machine learning and visualization to analyze the results of data
processing.
LEARNING OUTCOMES
• Application of machine learning techniques and the integration of various sources of information
data
- Analysis of sentiments and polarity on the set of tweets collected.
- Construction of a profile analysis using clustering algorithms
unsupervised (clustering).
- Implementation of a polarity analysis (sentiment analysis) on the set of
collected messages.
- Implementation of two alternative approaches to compare the performance obtained:
Dictionary-based approach. Vectorization approach (Word2Vec) and use of a supervised
machine learning model.
• Use of communication skills with interest groups to show the most relevant aspects of the results obtained in the results
of the process and their adaptation to the needs of the project.
• Application of innovative solutions and adaptation to changing environments. • Capacity for continuous
development of projects and communication of results and
decisions with visualization techniques and tools
• Coordination and communication with specialists, non-specialists, supervisors and clients
with the use of communication tools for the design of relevant information on the key aspects of the application.
• The evaluation will have a theoretical-practical nature and will be carried out systematically and continuously,
during the development of each module and at the end of the course.
• It may include an initial diagnostic evaluation to detect the starting level of the
student body.
• The evaluation will be carried out using the most appropriate methods and instruments to verify the different learning results,
and that guarantee its reliability and validity.
• Each evaluation instrument will be accompanied by its corresponding correction and scoring system in which the
measurement criteria to evaluate the results achieved by the students are explained, clearly and unequivocally.
12