Table of Contents
1. About Me
2. Curated Collections
3. Trending Articles
5. Let’s Work Together
My name is Frank Andrade. I studied engineering and specialized in Data Science. I’m currently navigating the world of freelancing by writing articles about Data Science on Medium and making videos on my YouTube channels Lean Languages with TV and Python en Español. I believe it’s a great idea to apply the knowledge you have in different fields you love. …
If you ever worked with text data, you know that most of the time you have to deal with dirty data. This means that the specific text you’re looking for (names, date, email, etc) would need to be extracted before it’s ready to use.
This is when regular expressions (regex) come in handy. Thanks to its syntax you will be able to find the pattern you wish to extract and save immense time.
Although regular expressions might look intimidating, at first sight, I created some animated images for all the regex we’re going to see in this article, so you…
Web Scraping is the process of extracting data from a website. Learning Web Scraping could be as easy as following a tutorial on how libraries like Beautiful Soup or Selenium work; however, you should know some concepts to understand better what these scraping tools do and come up with effective ways to tackle a task.
In this article, I made a list of 5 things I wish I knew when learning Web Scraping. They are either concepts you should understand before learning Web Scraping or advice to make your code more robust when scraping.
Before you start learning any Web…
As someone who has been working with Excel for many years, I know that the idea of learning things you can already do in Excel/VBA in a completely new environment like Python doesn’t sound so exciting. However, all the benefits that the Python environment offers make this worth it. For this reason, I came up with a useful and simple guide I wish I had when I switched from Excel to Python.
In this article, we’ll use Python’s Pandas and Numpy library to replace many Excel functions you probably used in the past.
Table of Contents 1. The Dataset 2…
You’ve been learning how to code for the past months and now you seem to have a solid knowledge of the basics, but without any experience, no one wants to hire you and you still don’t feel confident enough to participate in a coding competition like a hackathon to win the prize money.
At that moment, you might feel it’s almost impossible to start making money coding. I know this because I had that problem before … until I started writing programming articles on Medium.
You’re in the middle of a project and suddenly you need to make a plot to analyze the data or present the insights found. You don’t have too much time, but you definitely don’t want to create a plot that looks like this.
Recently I finished reading the book 1984 in Spanish and suddenly realized that not only have I read that book so many times, but I read it in different languages. You can argue that reading an original book will always be better than the translated version, but rereading a book in a different language gives you the great opportunity to enjoy reading a book you love while learning a new language.
This led me to create a list of books I read in the 4 languages I speak — English, Spanish, Portuguese and Russian. None of those books are about…
Tokenization is a common task a data scientist comes across when working with text data. It consists of splitting an entire text into small units, also known as tokens. Most Natural Language Processing (NLP) projects have tokenization as the first step because it’s the foundation for developing good models and helps better understand the text we have.
Although tokenization in Python could be as simple as writing
.split(), that method might not be the most efficient in some projects. …
Over the past years, I’ve downloaded countless language learning apps to become fluent in multiple languages. After trying dozens of apps, I realized some were vital to becoming fluent, while others were not.
For example, I found that language course apps that try to cover everything, such as Babbel, Busuu and Rosetta Stone, are not as useful as language apps that focus on improving a single language skill. …
Have you ever felt that you stopped making progress learning a language? You study and practice like you always do, but you feel you’re not improving in your language anymore. So what happened?
You’ve hit a plateau.
A plateau refers to a time when you stop making visible progress regardless of your best learning efforts. But don’t let a language learning plateau discourage you! That happens to everyone when learning a new skill. When you start learning a language, it’s easy to make quick progress, but as you learn more, you naturally slow down.
I’ve encountered many plateaus when studying…