Rodrigo Marques

Who am I?

I am an accomplished Data Scientist and Data Engineer with a passion for delivering valuable insights through analytical functions and data retrieval methods. I am currently pursuing a Masters degree in Data Science, Data Mining and Knowledge Discovery. My ultimate goal is to help companies advance by developing strategic plans based on predictive modeling and findings.

I bring a proven track record of analyzing complex data sets and serving as a strong advisor. I am curious by nature and always excited to tackle "unsolvable problems" with an out-of-the-box thinking approach. My love for learning and adapting keeps me evolving in the field of Data Science.

My skills include Data Science, Data Engineering, Business Intelligence, Data Analytics, AWS, Power BI, SQL, Python, DAX, Machine Learning, ETL, Snowflake, DBT, GIT, among others. Fluent in English and Portuguese, and conversational in Spanish and French. I am also a team player and possess strong communication and problem-solving skills. I am motivated to make a positive impact in the world through the power of data and technology

Coding & Technologies

Python

SQL

DAX

Power BI

GIT

AWS


SSMS, SSAS, SSRS

Oracle

Snowflake

DBT

Gsuite

Lean Six Sigma

Languages

English

Portuguese

Spanish

French

Professional Experience

Data Scientist

Aug 2021 - Current Pistil Data - San Francisco, CA, USA

Pistil is at the forefront of the cannabis industry, providing cutting-edge market intelligence technology to sales teams of all sizes.
Responsibilities:
* Develop and maintain high-quality code for data science projects.
* Support the customer support and business teams with technical expertise.
* Meet deadlines and deliver high-quality deliverables.
* Collaborate with cross-functional teams to integrate and productionize data science deliverables.
* Ensure scalability and performance of data science solutions in a cloud-based environment.

Tools and Technologies:
SQL, Python, Pandas, Scikit Learn, Scipy, Seaborn, SQLAlchemy, df2gspread, Plotly, Matplotlib, Git, Azure data studio, GSuite, Hubspot.

Contributions:
* Implemented advanced data analysis techniques, resulting in improved platform performance.
* Developed new features to meet the evolving needs of customers, resulting in increased user engagement and satisfaction.
* Streamlined the data science workflow by implementing automation and improving efficiency.
* Provided valuable insights and analysis to the business team, contributing to an increase in sales and revenue.

Data Engineer

Feb 2022 - Current Cloud(x) - Buenos Aires, Argentina

Cloud(x) is a leading software development firm, building AI and cloud solutions at the forefront of modern technology.
Worked as a consultant at Symend (Canada), a company that generates deep customer insights using a trifecta of behavioral science, data science, and advanced analytics to empower customers to resolve past due bills before they reach collections.

Responsibilities:
* Collaborate with cross-functional teams to ensure data resources meet requirements
* Develop and automate data pipelines using best practices
* Research and implement new technologies to improve data engineering processes
* Work closely with data scientists to support the data science life cycle
* Develop data engineering tools and libraries for testing, QA, and scaling
* Partner with analytics platform team to understand data structures and requirements
* Communicate ideas, requirements and challenges to team lead and product managers

Tools and Technologies:
* Python, SQL, Snowflake, DBT, Databricks, Spark, Azure Data Factory, Looker, ADF

Contributions:
* Developed data ingest, propagation and transformation code for Symend Canada, improving performance and user engagement
* Implemented advanced data engineering techniques to improve pipeline performance
* Developed innovative features to meet evolving customer needs
* Streamlined data engineering workflow through automation
* Provided valuable insights and analysis to the business team, resulting in increased sales and revenue
* Demonstrated strong technical expertise and a proactive approach in collaborating with crossfunctional teams

Data Intelligence Consultant

Nov 2021 - Feb 2022 wDiscover - Curitiba, PR, Brazil

As a Business Intelligence Consultant at wDISCOVER, a company that specializes in creating innovative solutions using applications and indicators to improve company's control and decision making, I had the opportunity to work on a project for Ouro Verde which consisted of migrating their legacy system to AWS.

Responsibilities:
* Assisted in the migration of Ouro Verde's legacy system to AWS, utilizing technologies such as Redshift, EMR, and Athena.
* Collaborated with a team of experts to create innovative solutions using applications and indicators to improve company's control and decision making.
* Ensured the best use and performance of BI tools such as Qlik, Tableau, and Power BI by working with a team of trained and certified professionals.

AWS technologies used:
* Redshift, EMR, Athena, Batch, Step Functions, Elastic Container Registry, S3, EC2, CloudWatch

Contributions:
* Made valuable contributions to the successful migration of Ouro Verde's legacy system to AWS, resulting in improved scalability and efficiency.
* Implemented innovative solutions that improved company's control and decision making process.
* Worked with a team of experts to ensure the best use and performance of BI tools, resulting in more effective data analysis and decision making

Data Science & Business Intelligence Analyst

Aug 2020 - Aug 2021 TKSolution | Tok&Stok - São Paulo, SP, Brazil

As a Data Science & Business Intelligence Analyst at TKSolution, a technology hub for the home decor and furniture retailer Tok & Stok, I played a vital role in driving intelligence, strategy, innovation, and valuable solutions through data analysis and visualization.

Responsabilities:
* Developed and implemented data-driven solutions to address business challenges and drive intelligence, strategy, and innovation
* Utilized a combination of ETL, data analysis, and data visualization to produce robust and efficient outputs for specific briefs or problems
* Created intelligent cubes and managerial reports using technologies such as Power BI and Reporting Services
* Worked with a variety of data sources including Oracle, Snowflake, APIs, and DBT to perform data ingestion and manipulation
* Utilized SQL, PL/SQL, Python, DAX, and M to manipulate complex data and answer specific business needs

Tools and Technologies:
Power BI, DBT, SSMS, SSAS, SSRS, Git, Oracle, Snowflake, Gsuite, Jira, Confluence, AWS (Glue, S3, Database Migration Service, Cloudwatch)

Contributions:
* Successfully translated non-data requests into analytical or reporting outputs that answered specific business questions
* Improved data management and manipulation capabilities through the use of advanced technologies such as Oracle and Snowflake
* Demonstrated the ability to manage projects in an ambiguous environment and effectively drive change and overcome obstacles and resistance.

Electrical Engineering Intern - Power Load Forecasting

Dec 2018 - Dec 2019 Energisa - Palmas, TO, Brazil

Data & Analytics centered experience as the main responsible to conduct the analysis and forecast for electrical load and power flow for the State of Tocantins.

As results of this jorney:
* Achieved a significant reduction of man-working time needed due to implemented automation and standardization of power load data;
* Improved insights generation and decision making with meaningful visualizations for the behavior of the power grid, in Power BI;
* I carried out dozens of studies to forecast the annual behavior of Tocantins electrical system up to the year 2035.

Summer Internship in Lean Six Sigma

Jun 2015 - Jul 2015 University of Tennessee & Comercial Vehicle Group - Knoxville, TN, USA

As an intern for the University of Tennessee Lean Six Sigma program, I had the opportunity to work with Comercial Vehicle Group Ltda (CVG), a leading manufacturer in the commercial vehicle industry. During my time at CVG, I was responsible for analyzing the company database and improving process flow in the testing laboratory. Through my efforts, CVG was able to achieve significant improvements in efficiency and process management.

Responsibilities:
* Analyzed and improving process flow in CVG's testing laboratory
* Identified and mitigating rework and inefficiencies
* Maintained data integrity and standardization in the database

Contribution:
* Reduced waiting time by 10.79% in the testing laboratory
* Solved problems with management processes and mitigated rework
* Optimized the micro-tasks of each process, resulting in improved efficiency and productivity.

Education

Master's degree, Data Science - Data Mining & Knowledge Discovery

2022 - 2024 Faculdad de Buenos Aires

.

MBA Business Intelligence

2020 - 2021 Faculdade Educamais

Strategic Negotiation; Market research; Tools for Decision Making; Corporate Governance; Business Intelligence; Business Analytics; Operations and Process Management; Business Intelligence Projects; Leadership and People Management; Time Management; Storytelling; Economic Scenarios.

BS Electrical Engineering

2016 - 2019 UniCatólica - TO, Brazil

Algorithms and Data Structures, Logic Circuits, Introduction to Material Sciences, Microelectronics I and II, Electrical Circuits I and II, Communication Principles, Solid Mechanics, Linear Signals and Systems, Electromagnetic Theory I and II, Transport Phenomena, Networks of Computers, Power Electronics, Data Communication Systems, Automation, Instrumentation and Control, Energy Conversion, Electromagnetic Waves, Safety Engineering, Analysis of Electrical Power Systems, Protection of Electrical Systems, Electrical Machines, Microprocessors, Power Generation Electricity, Substations, Electricity Transmission and Distribution, Building Electrical Installations, Electricity Quality.

Electrical Engineering - Sandwich Undergrad.

2016 - 2019 Eastern Washington University - WA, USA

Academic exchange program with duration of 18mo at Eastern Washington University, WA, United States.

TECH 452 – Engineering Economics; EENG 209: Circuit Theory I; EENG 330: Microelectronics I; CEB 330 Digital Foundations; EENG 250: Digital Hardware; EENG 160 – Digital circuits; Dpt of Modern Languages and Literatures: French 101

I am good at...

Responsive Design

Data Science & Analytics

DSaaS allows you to embrace data science for your business quickly. If you don't have data science capabilities in-house or, if you do, but they're over capacity, I can help you to ramp up your data science and analytics capabilities fast so you can focus on driving business results.

Photography

Business Intelligence


Either you need to design, develop and deploy enterprise processes and to integrate, support and manage the related technology applications and platforms. These services include business and infrastructure applications for BI platforms, analytics needs and data warehousing infrastructure.

Creativity

Power BI

I will work collaboratively with end users to develop reporting systems that provide accessible information for decision-making. I'll use warehouse data to solve organizational problems through reports, analysis and data visualization.

Advetising

Data Engineering

Data engineering is the practice designing and building systems for collecting, storing, and analyzing data at scale. Organizations have the ability to collect massive amounts of data, and they need the right people and technology to ensure the data is in a highly usable state by the time it reaches the data scientists and analysts.

Happy Clients

image
image
image
image
image
image
image
image



Here you can find the summary of my main projects as well as their links for more detailed exploration.


Happiness
Built a deep analysis on which factos more contribute to the perception of happiness around the world. Modeled and Optimized a model to better estimate the score the Countries would get based on each of the attributes (MAE ~ 0.38).

Tacked questions like:
*Is this GDP per capita which makes you happy?
*Is this Perception of Corruption about Goverment, which make you sad?
*Is this Freedom of Life Choises which makes you happy?
Handwritten Digit Recognition CNN Practice (MNIST Dataset)

In this notebook, I have covered the necessary steps to approach any Machine Learning Classification Problem.
Included Image Visualization for better understanding.
Quick Links to the functions I have used to explore it in depth.
Basic techniques such as Confusion Matrix, Image Augmentation, etc.
I have also compared the results of Model without using CNN and CNN.
Time Series Analysis and Forecast of Total Factor Productivity (TFP)
The measure of TFP is based on 39 variables that measure relative levels of income, output, inputs and productivity for 167 countries between 1950 and 2011. From 90's to 2000's, the TFP showed a global growth trend concentrated in manufacture sactor, and particularly in Information Technology. Over the long term, TFP growth is limited only by the ability of innovators to develop new technologies, and that a larger population makes possible a larger pool of talent to be devoted to research, and thus opens up more potential for innovation.
Sentiment Analysis for an Mobile App Reviews
Situation: The client is a financial sector app that aims to achieve greater visibility and retention of users in an organic way. As this market has grown a lot and is very competitive, he decided to invest in ASO, focusing on his efforts not only in views but also in installations, that is, not only users will see the App in the store but will also install it and use it for a long period of time. (Retention of 15-30 days).
Data Science Salary Estimator
To help data scientists better negotiate their income when they get a job.
* Created a tool that estimates data science salaries (MAE ~ $ 11K)
* Scraped over 1000 job descriptions from glassdoor using python and selenium.
* Engineered features from the text of each job description to quantify the value companies put on the knoledge about some topics: python, excel, aws, sql and spark.
* Optimized Linear, Lasso, and Random Forest Regressors using GridsearchCV to reach the best model.
* Built a client facing API using flask
Business Analysis of a Mobility Company
Discussed the main problems of a mobility company and how to solve them.
* Net profit/growth
* Cancelled orders
* Total clients
* Profit by customer
* Driver engagement

Timeline

DBT Fundamentals

Mar 2022 DBT Labs


Credential URL

Snowflake - The Complete Masterclass

Feb 2022 Udemy


Credential URL

Unified Data Analytics Essentials

Jun 2021 Databricks


Credential URL

Deploying Scalable Machine Learning for Data Science

Jan 2021 linkedIn


SQL: Data Reporting and Analysis

Dec 2020 linkedIn


Python for Data Science and Machine Learning Bootcamp

Aug 2020 Udemy


Credential URL

Power BI Data Modeling with DAX

Apr 2020 linkedIn


Business Analysis Foundations

Feb 2020 International Institute of Business Analysis + linkedIn


Introduction to Data Science: Storytelling with data

Feb 2020 International Institute of Business Analysis + linkedIn


C2 Proficient English - Standard English Test

Jan 2020 EF Standard English Test (EF SET)


Lean Enterprise Systems Program

Jul 2015 University of Tennessee, Knoxville