avatarJosep Ferrer

Free AI web copilot to create summaries, insights and extended knowledge, download it at here

2873

Abstract

most! :)</b></p><figure id="118c"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*Of4ewTnPe7BgspbolK7NXQ.png"><figcaption>Self-made screenshot of the Kaggle.</figcaption></figure><p id="b691"><b>Type of data:</b> Miscellaneous. <b>Access:</b> Free, but registration required. <b>Sample Datasets:</b></p><ul><li><a href="https://www.kaggle.com/sudalairajkumar/daily-temperature-of-major-cities"><b>Daily temperature of major cities</b></a><b></b>Daily level average temperature values are present for world the world’s major cities.</li><li><a href="https://www.kaggle.com/datasets/danushkumarv/glass-identification-data-set"><b>Glass Identification Data Set</b></a><b> </b>— Data set to train a model to identify different types of Glass.</li></ul><h2 id="607c">#3. Google Dataset Search</h2><p id="f225">It seems today we turn to Google for everything, and data is no exception. Launched in 2018, Google Dataset Search is like Google’s standard search engine, but strictly for data. It aggregates data from external sources, providing a clear summary of what’s available. <b>It’s an excellent place to start checking around any new topic.</b></p><figure id="cf5b"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*aF1OS0DkNu0qngwv9-PlEw.png"><figcaption>Self-made screenshot of the Google Dataset Search.</figcaption></figure><p id="d925"><b>Type of data:</b> Miscellaneous. <b>Access:</b> Free to search, but does include some fee-based search results. <b>Sample Datasets:</b></p><ul><li><a href="https://datasetsearch.research.google.com/search?src=0&amp;query=birth%20rate&amp;docid=L2cvMTFqOWJ0Yl90MA%3D%3D&amp;filters=WyJbXCJpc19hY2Nlc3NpYmxlX2Zvcl9mcmVlXCIsW11dIl0%3D&amp;property=aXNfYWNjZXNzaWJsZV9mb3JfZnJlZQ%3D%3D"><b>Births and Birth Rates Data in the USA</b> </a>— This dataset includes birth rates for females by age group in the United States since 1940.</li><li><a href="https://datasetsearch.research.google.com/search?query=coffee&amp;docid=PjCM7IOdEL7RNkP0AAAAAA%3D%3D"><b>Global price of coffee</b></a>— This dataset contains reviews of 1312 arabica and 28 robusta coffee beans from the Coffee Quality Institute.</li></ul><h1 id="4250">#4. Datahub.io</h1><p id="9d49">The goal of many data analysts is to help drive savvy business decisions. As such, using economic or business datasets for your portfolio project might be worth considering. <b>Datahub covers a wide variety of topics from health to demographics.</b> However, it has a specific focus on economic fields like stock market data, property prices, inflation, and logistics.</p><p id="73b0">Because many of the data on the portal is updated monthly — or even daily — <b>you’ll always have something fresh to work with.</b></p><figure id="93d5"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*pThfomW2AgFAo6PQ65eTWw.png"><figcaption>Sel

Options

f-made screenshot of the Datahub.io</figcaption></figure><p id="743c"><b>Type of data:</b> Mostly business and finance. <b>Access:</b> Mostly free, no registration required. <b>Sample Datasets:</b></p><ul><li><a href="https://datahub.io/core/glacier-mass-balance"><b>Average mass of glaciers since 1945</b></a><b></b>Average cumulative mass balance of reference Glaciers worldwide from 1945–2014.</li><li><a href="https://datahub.io/core/population-city"><b>City Population Annual Timeseries by city</b></a>—City population by sex, city and city type.</li></ul><h2 id="63ac">#5. Datahub.world</h2><p id="7209"><b>Data.world provides a wide range of user-contributed datasets. It also offers a platform for companies to store and organize their data.</b> You can find any kind of dataset. It presents a discovery function that allows you to know what are the most popular datasets at the moment. Nice to have!</p><figure id="7f2b"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*6wZcCgYx8T20TScMP-hj2A.png"><figcaption>Self-made screenshot of the Data.World</figcaption></figure><p id="d159"><b>Type of data:</b> Mostly business and finance <b>Access:</b> Mostly free, no registration required <b>Sample dataset:</b></p><ul><li><a href="https://data.world/markmarkoh/coronavirus-data"><b>Coronavirus Daily Data</b></a><b> </b>— Coronavirus new cases, death and total cases updated daily by country.</li><li><a href="https://data.world/us-hhs-gov/a74baa1b-2b5f-4b3c-8b47-56afe07008f2"><b>USA Chronic Disease Indicators</b></a><b> </b>— Provides cross-cutting set of 124 indicators that were developed by consensus and that allows states and territories and large metropolitan areas to uniformly define, collect, and report chronic disease data</li></ul><p id="da5d"><b>Hope you find this useful! Any further questions you might have, do not hesitate on asking :)</b></p><p id="a97b"><b>Data always has a better idea — trust it.</b></p><p id="bb38">Don’t forget to follow <a href="https://medium.com/forcodesake"><b>ForCode’Sake</b></a><b> to get more articles like this one! ✨</b></p><figure id="8fbb"><img src="https://cdn-images-1.readmedium.com/v2/resize:fit:800/1*1sss-BxGrJ2DDwUr1PzcEA.png"><figcaption></figcaption></figure><p id="3fdb">You can subscribe to my <a href="https://medium.com/subscribe/@rfeers"><b>Medium Newsletter</b></a><b> to stay tuned and receive my content</b>. <i>I promise it will be unique!</i></p><p id="3ab4">If you are not a full Medium member yet, <b>just check it out <a href="https://medium.com/@rfeers/membership">here</a> to support me and many other writers. </b><i>It really helps </i>:D</p><p id="6692">You can find me on <a href="https://twitter.com/intent/user?screen_name=rfeers"><b>Twitter</b></a> and <a href="https://www.linkedin.com/in/josep-ferrer-sanchez/"><b>LinkedIn</b></a> as well!</p></article></body>

5 Great Places to Find Free Datasets for Your Next Project

Wondering where to find free and open datasets for your next data project? Then just stop by!

Self-mage image. 5 Great places to find free Datasets for your next project.

If you’re looking for a job in data analytics, you’ll need a portfolio to demonstrate your expertise.

Of course, if you’re new to data analytics, you probably don’t have any expertise!

But no worries!

The fact you might not have worked on a paid project yet doesn’t mean you can’t have a compelling portfolio using some practice datasets.

Fortunately, the Internet is full of datasets, most of which are completely free to download — thank god humans created the open data initiative! ;)

In this post, I share some first-rate repositories — which I used the most — where you can find any kind of dataset you want.

Any new source is welcome — so the list keeps expanding! If you have a favorite one, please let me know! :D

#1. Awesome-Public-Datasets on Github

This github hosts a library of awesome, public datasets! They are all sorted by category and link you straight to the hosting website. It is updated regularly and every day new datasets are being uploaded. Stay tuned! 👀

Self-made screenshot of the Awesome-Public-Datasets on Github.

Access: Free to access, but does include some fee-based options. Type of data: Miscellaneous. Sample Datasets:

  • The 1000 Genomes Project — The project ran between 2008 and 2015, creating the largest public catalog of human variation and genotype data.
  • Plane crash database — Plane crash data dating from 1929 to now.

#2. Kaggle

Kaggle is a worldwide community that offers aggregated datasets. Kaggle launched in 2010 with a number of machine learning competitions, which subsequently solved problems for the likes of NASA and Ford. It has since evolved into a renowned open data platform, offering cloud-based collaboration for data scientists and tonnes of great datasets covering almost any topic you can imagine. Literally, you can find anything here!

It is the dataset source I use the most! :)

Self-made screenshot of the Kaggle.

Type of data: Miscellaneous. Access: Free, but registration required. Sample Datasets:

#3. Google Dataset Search

It seems today we turn to Google for everything, and data is no exception. Launched in 2018, Google Dataset Search is like Google’s standard search engine, but strictly for data. It aggregates data from external sources, providing a clear summary of what’s available. It’s an excellent place to start checking around any new topic.

Self-made screenshot of the Google Dataset Search.

Type of data: Miscellaneous. Access: Free to search, but does include some fee-based search results. Sample Datasets:

#4. Datahub.io

The goal of many data analysts is to help drive savvy business decisions. As such, using economic or business datasets for your portfolio project might be worth considering. Datahub covers a wide variety of topics from health to demographics. However, it has a specific focus on economic fields like stock market data, property prices, inflation, and logistics.

Because many of the data on the portal is updated monthly — or even daily — you’ll always have something fresh to work with.

Self-made screenshot of the Datahub.io

Type of data: Mostly business and finance. Access: Mostly free, no registration required. Sample Datasets:

#5. Datahub.world

Data.world provides a wide range of user-contributed datasets. It also offers a platform for companies to store and organize their data. You can find any kind of dataset. It presents a discovery function that allows you to know what are the most popular datasets at the moment. Nice to have!

Self-made screenshot of the Data.World

Type of data: Mostly business and finance Access: Mostly free, no registration required Sample dataset:

  • Coronavirus Daily Data — Coronavirus new cases, death and total cases updated daily by country.
  • USA Chronic Disease Indicators — Provides cross-cutting set of 124 indicators that were developed by consensus and that allows states and territories and large metropolitan areas to uniformly define, collect, and report chronic disease data

Hope you find this useful! Any further questions you might have, do not hesitate on asking :)

Data always has a better idea — trust it.

Don’t forget to follow ForCode’Sake to get more articles like this one! ✨

You can subscribe to my Medium Newsletter to stay tuned and receive my content. I promise it will be unique!

If you are not a full Medium member yet, just check it out here to support me and many other writers. It really helps :D

You can find me on Twitter and LinkedIn as well!

Data Science
Data
Programming
Python
Technology
Recommended from ReadMedium