Google Dataset Search Out Of Beta
Written by Kay Ewbank   
Thursday, 30 January 2020

Google's customized search engine for 'scientists, data journalists, and data geeks' is now out of beta, and offers indexed searches for almost 25 million datasets. Dataset Search now has added filters so you can look for specific types of dataset, or only those that are free from the provider.

Dataset Search was first released in a beta version in September 2018, and aims to make it easier to search online open-access data. Until now, the problem of finding such data is that it doesn't necessarily show up in a standard search even though many government departments and academic institutions publish their data online. Google Dataset Search relies on institutions adding open-source metadata tags that Dataset Search then uses to index the data sets. Dataset Search  works in a similar way to Google Scholar, which can be used to search academic papers for data.

google

The metatags are indexed by Dataset Search and combined with input from Google’s Knowledge Graph, which is what shows as an infobox next to search results to make the results more useful. Google collects and links this information, analyzes where different versions of the same dataset might be, and finds publications that may be describing or discussing the dataset.

The main improvement to the updated version is the ability to filter the results based on the types of dataset that you want, such as  tables, images, or text, or on whether the dataset is available for free from the provider. If a dataset is about a geographic area, you can see the map.

The search also now works on mobile devices, and the dataset descriptions have been "significantly improved". The developers say that until now, the most popular queries include "education," "weather," "cancer," "crime," "soccer," and "dogs", so all you cat lovers out there need to up your searching.

google

More Information

Dataset Search

Related Articles

Google Makes Dataset Discovery Easier

Google Uses Search Data To Predict Box Office Hits

The Allen Institute's Semantic Scholar 

RankBrain - AI Comes To Google Search

Allen Institute Asks "Can You Make An AI Smarter Than An 8th Grader"

Fuzzy Logic And Uncertainty In AI

Find Prior Art Added to Google Patent Search 

 

To be informed about new articles on I Programmer, sign up for our weekly newsletter, subscribe to the RSS feed and follow us on, Twitter, Facebook or Linkedin.

Banner


Unicode 13 Released
13/03/2020

There's an updated version of the Unicode Standard. Version 13.0 adds 554 characters, four new scripts, and 61 new emoji characters.



Amazon Announces Bottlerocket
20/03/2020

Amazon has announced the public preview of Bottlerocket, a new open source Linux-based operating system that is purpose-built to run containers. Bottlerocket can run on virtual machines as well as bar [ ... ]


More News

graphics

 



 

Comments




or email your comment to: comments@i-programmer.info

Last Updated ( Thursday, 30 January 2020 )