CV

Hello, there 👋🏾!

I'm an economist and data scientist with over 7 years of professional experience and more than 10 years working with data. I specialize in using quantitative methods to process, analyze, and visualize data for decision-making purposes.


Throughout my career, I have worked on various projects involving the processing and visualization of open data to generate economic and social indicators used for diagnosing and evaluating public policies. I have also participated in projects that required a range of skills, including developing statistical and econometric models, processing geospatial information, evaluating experimental treatments, monitoring media using web scraping, and creating web platforms and APIs, among others involving technology and analysis. More details are in the next sections (below the CV PDF).

I have a passion for teaching programming, and for over 3 years, I have had the privilege of delivering courses on introductory data management, information visualization, and machine learning. Currently, I'm a data science teacher at Le Wagon.

CV_Juan_Santos english.pdf

Specialized skills

My main expertise is data analysis, visualization, and statistical modeling, and throughout the years I have gained some extra specialized skills such as:

Web Scrapping

These are some of the projects that I have developed:

For these projects, I always use Python and libraries as requests and BeautifulSoup, and sometimes with the Django framework to create web platforms.

Statistical modeling & Machine Learning

I took the Deep Learning Diploma (in-person) offered by the CIMAT-INAOE consortium for Artificial Intelligence, and also the Deep Learning Specialization (online) by Coursera. I have also taught about machine learning models when in CIDE and also on Le Wagon.

Some of the projects where I applied statistical models and machine learning techniques are:

Complex surveys analysis

I have worked with microdata from complex survey designs to calculate customized estimations and econometric models.  When using these datasets, it's important to consider some elements of the survey design as the strata, FPC, and weights. Some of the surveys I have worked with are:

When working with complex surveys, I usually use the Stata software or the R programming language.

Geospatial information

I have experience using GIS tools as QGIS, ArcGIS and GeoDa to analyze and visualize geospatial data. I have also worked on projects where I have used Python to automate GIS processes, saving time and increasing efficiency. 

As a data scientist, I have had the opportunity to work on projects where I have made extensive use of geospatial calculations, vector operations, and map visualizations. Specifically, I have experience in tasks such as calculating distances, areas, spatial autocorrelation, clustering, and conducting vector operations such as centroids, buffers, unions, intersections, and differences, among others. I have also developed skills in map visualization techniques such as choropleths, dot density, heatmaps, animations, and interactive visualizations. Additionally, I have experience with raster operations, including georeferencing images, conducting zonal statistics, and analyzing differences over time.


These are some of my most relevant works related to GIS that are publicly available: 

Causal inference

As an Economist, I took specialized courses on causal inference, using experimental and quasi-experimental techniques (difference-in-difference, propensity score matching, synthetic controls, panel data, and instrumental variables).

Text analytics

I have worked extensively with texts, analyzing thousands of documents in tasks like:

You can check my public repository (in Spanish) for the workshop that I presented to employees of the prosecutor's office of the State of Hidalgo:

Web development

I have used Flask, Django, and Ruby On Rails to make some web development projects:

Cloud computing

Teaching