Interesting data sets

Search Browse by Topic Explore Data Available APIs Geography Guidance for Developers We all know how the vibrant gaming industry keeps us yearning for its dose of data, trends, and statistics. Contribute to awesomedata/awesome-public-datasets development by creating an account on GitHub. In the book I might use all the records (19,509 records) or I might only use the numbers that are 10 and higher (19,482 records). RS. Data from the comparison group will be used to develop a model for the new release. Data Sets. We work with data providers who seek to: Democratize access to data by making it available for analysis on AWS. Thus, it’s a fairly small data set where you can attempt any technique without worrying about your laptop’s memory being overused. NEW: We now have a machine-readable dataset discovery service available in beta release. Description: This data set was used in the KDD Cup 2004 data mining competition. makes numerous interesting data sets related to taxation and income inequality publicly available. It gives people reasons to listen to you. Our Team Terms Privacy Contact/Support Terms Privacy Contact/SupportData sets for Regression Short Course The first few data sets from the class notes are listed below. Much of the data collected by the Health Department are available for use by the public. The File Name gives the name of the file containig the data set and is often the original name of the data set as well. " Offers numerous free data sets in a searchable database. For example, to study the relationship between height and age, only these two parameters might be recorded in the data set. List of Public Data Sources Fit for Machine Learning Below is a wealth of links pointing out to free and open datasets that can be used to build predictive models. Follow us on Twitter for updates! @cooldatasets. Learn about some of the many interesting social media datasets available to you, some of which are quite new, and the different features and challenges they offer you for your next big data science project. Go to the next page of charts, Fun With Public Data Google went live with an experimental tool that allows people to explore various public data sets - from unemployment in the U. © 2019 Kaggle Inc. The recent DC Data Science meetup was July 30, 6:30 PM to 8:30 PM, at GWU, Funger Hall, Room 103, 2201 G St. Both sets of data presented YQL offers to connect to a lot of data sources as well. See the bottom of the post for a CSV verson. I suppose you can find pretty much anything on the Internet. The first few are spelled out in greater detail. They fail to realize the amount of learning they can get out from working on these projects to get a boost in their career. Government Datasets; Science Datasets; Entertainment Datasets; Machine Learning Datasets; Miscellaneous Datasets; User May 29, 2014 Looking for interesting data sets? Here's a list of more than 100 of the best stuff, from dolphin relationships to political campaign donations to A topic-centric list of HQ open datasets. Maybe there are just too many possible questions, maybe it's a pain to set up analytical tools, or maybe it's just too easy to get distracted by animal GIFs. The data has 506 rows and 14 columns. Ky Harlin BuzzFeed, Director of Data Science. UN data, a data access system to UN databasesCreating Data Mining Projects. ) Prepare to research. At the end of the week, we'll tally up all the poll responses and compare the results to all of the previous polls. This page contains downloadeble statistics data. m. This set includes information about local businesses in 10 metropolitan areas across 2 countries. This is an interesting startup and will be very interesting to follow their growth. Also, these data sets have enough open tutorials to get you going. An important note to users with version 1. But loads of people are still treating the concept of big data as Training Datasets. Other amazingly awesome lists can be found in sindresorhus's awesome list. introductory Public Data Sets This page allows you to download Educational Entity Master (EEM) data sets containing public data for the Entity Types selected. Most of the data sets listed below are free, however, some are not. Linda Boudreau. 100+ Interesting Data Sets by Robert Seaton. Nonetheless, it's fun to consider the causal relationships one could infer from these correlations. Variables of interest include farm type, farm size, inputs, technology, products as well as economic indicators. This data set includes provider data for the hip/knee complication measure, and the Agency for Healthcare Research and Quality (AHRQ) measures of serious complications. These data sets are provided for teachers of statistics to use with their students. Poverty Statistics. You already know that data is the bread and butter of reports and presentations. you will have fun working on the challenge this dataset poses. Kai Xin changed description of 100+ Interesting Data Sets for Statistics. It has been used in computational investing research. Each value is known as a datum. Luckily someone already did the hard work of scraping the requisite data. A interesting study will be to model the changes of the clusters over time. Developed by Thomas Hoffmann with Michael Russell at inTASC. This page contains sites relating to Data Sets. You need to Data obtained from A Handbook of Small Data Sets, edited by D. So he put compost in the ground floor garden plants and did not put it in the garden on the terrace. Economy Case Study Datasets and project suggestions: The most common structure is to cluster the people based on their interaction. Use this data set from Donors Choose to determine the characteristics that make the funding of projects more likely. List Price Vs. Displaying datasets 1 The links below provide websites with data sets, and websites that can help you manage and use data. Sep 13, 2016 If you've ever worked on a personal data science project, you've probably spent a lot of time browsing the internet looking for interesting data Aug 21, 2018 The first step is to find an appropriate, interesting data set. Data Science for Good events will unite the energy and talent of a diverse community to Users of this service have access to data sets, documentation, and questionnaires from NCHS surveys and data collection systems. Contribute to awesomedata/ awesome-public-datasets development by creating an account on GitHub. S. A bit macabre perhaps, but the US Fatality Analysis Reporting System data is really fun to play with. nz which The Reddit data isn't actually the top 2. gc. edu/2015/09/30/lab-launch-and-data-setsDeep and interesting datasets for computational journalists: a quick list. world Feedback Other interesting datasets include the chemical composition from wine grown in Italy, Abalone physical characteristics, and heart disease data. They don’t realize the amount of data sets available in open. One of the first things you learn in any statistics class is that correlation doesn't imply causation. Thunder Basin Antelope Study Systolic Blood Pressure Data Test Scores for General Psychology Hollywood Movies All Greens Franchise Crime Health Baseball Basketball Denver Neighborhoods Using Technology: U. Statistics project ideas for students. Most commonly a data set corresponds to the contents of a single database table, or a single statistical data matrix, where every column of the table represents a particular variable, and each row corresponds to a given member of the data set in question. It began on 19 th May 2007 and was updated on 6 th November 2007. This is not meant to be a comprehensive list of every data web site but just what I have found most useful for creating my infographics about the US economy. EDIT: Updated the link to mega. See this post for more information on how to use our datasets and contact us at info@pewresearch. Data sets and corpus / corpora for biological Some Datasets Available on the Web . Classification, Clustering . Here's a json file containing 216,930 Jeopardy questions, answers and other data. unic. The field combines tools from statistics and artificial intelligence (such as neural networks and machine learning) with database management to analyze large digital collections, known as data sets. Real . There is so much great work being done with data quality and data analytics tools. Our Team Terms Privacy Contact/Support. PR ☛☛☛. The data is freely available for anyone to use and the data can be used by any researcher without further permission by the IIF. Here are a few ideas that might make for interesting student projects at all levels (from high-school to graduate school). Data mining, also called knowledge discovery in databases, in computer science, the process of discovering interesting and useful patterns and relationships in large volumes of data. A place to find cool datasets. #source: B. The Math Forum's Internet Math Library is a comprehensive catalog of Web sites and Web pages relating to the study of mathematics. Follow Us and other data by subject Current Population Survey (CPS) Allows users to conveniently search multiple data sets Data. Browse this list of public data sets for data that you can use to prototype and test storage and analytics services and solutions. (that catalogues datasets and makes them available), here are some interesting data sets …Interesting datasets and APIs. Kai Xin copied 100+ Interesting Data Sets for Statistics from Template in list Data Board Data Science. Public-use data files are prepared and disseminated to provide access to the full scope of the data. The world may indeed be awash with data, however, it is not always easy to find a suitable data set when you need one. ) and commodity data, Data sources: National Spelling Bee and Centers for Disease Control & Prevention. This link will direct you to an external website that may have different content and privacy policies from Data. 2010 Federal STEM Education Inventory Data Set 160 recent views Office of Science and Technology Policy, Executive Office of the President — This data set provides information for STEM education (pre-kindergarten through graduate) investments funded by Federal …Statistical Science Data Sets - Large index of data sets from fully processed to raw. org. Mark Mulcahy – Waterford Technologies. A more complicated and interesting problem than it may seem at first glance! A full solution sets out the math in detail. nz which Kai Xin changed description of 100+ Interesting Data Sets for Statistics. It’s a struggle to look for reputable and legitimate sources Open Datasets. The Data Set Name is the name I gave each data set in the notes. org with any questions. You should decide how large and how messy a data set you want to work with; while Aug 15, 2018 This is a great place for Data Scientists looking for interesting datasets with some preprocessing already taken care of. When I started out doing data I tought myself how to scrape websites. You can use the listed data sets to easily test basic correctness but you can’t use them to test scaling behaviors. The world is full of interesting datasets. Homepage. cy . Spam – Non Spam An interesting problem with 1324 SMSs (Span and non-spam). Using and Handling Data . Time Series Data Library - Curated by Professor Rob Hyndman of Monash University in Australia, this is a collection of over 500 datasets containing time-series data, organized by category. interesting data setsSep 13, 2016 If you've ever worked on a personal data science project, you've probably spent a lot of time browsing the internet looking for interesting data Aug 21, 2018 The first step is to find an appropriate, interesting data set. Yelp Dataset Challenge Discover what insights lie hidden in our data. DASL provides data from a wide variety of topics so that statistics teachers can find interesting, real-world examples for their students. Discover a correlation: find new correlations. The world is full of interesting datasets. Other amazingly awesome lists can be found in sindresorhus's awesome list. This link will direct you to an external website that may have different content and privacy policies from Data. For more information and to submit an entry www. data for investigating whether whether phrasing affects Training Datasets · Interesting training data for Machine View ALL Data Sets: Wine Quality Data Set The goal is to model wine quality based on physicochemical tests So it could be interesting to test feature Stata examples and datasets from different sources around the world. I am investing at Lending Club and wanted to share this bonus offer with you. Original source was 1990 United Nations data. Each data set comes with rich metadata, including information about relevant papers, data sources, datatypes, and more. There are 50 000 training examples, describing the measurements taken in experiments where two …Pew Research Center makes its data available to the public for secondary analysis after a period of time. Beginner Level: This level comprises of data sets which are fairly easy to work with, and don’t require complex data science techniques. org with any questions. As individuals, we are lucky to have access to more data than ever before, as data sets continue to be made available online for free. Jan 21, 2016 · 10 Great Healthcare Data Sets. Warning. Round 13 has kicked off starting January 15, 2019 and will run through December 31, 2019. 9 Fascinating Datasets Available Online for Free. Toggle navigation Data Interview Q's. Content: Unlike the traditional Census of Agriculture data set, which only covers a single year, the linked file connects multiple censuses (1986, 1991, 1996, 2001, 2006 and 2011). statistics on fun random stats facts statistics interesting trivial trivia games facts fast educational stupid funny random facts about life Asia ## ## ## Africa ## ## ## Australia ## ## ## Pacific Islands ## ## ## Posted in Education. All datasets have been reviewed to conform to Yahoo's data protection standards, including strict controls on privacy. Data pairs for simple linear regression Data for multiple linear regression Data for one-way ANOVA Data for two-way ANOVA Additional information and activities using these data sets are available in the Technology Guide (ISBN: 0-618-20557-8) and Excel Guide (ISBN: 0-618-20556-X) that accompany Understandable Statistics, 7e. The Data And Story Library is brought to you by Data Description, creators of Data Desk: Data Analysis and Exploration software. U. PREV DATA SET Data Sets. Data, Research Resources, & Blogs. Share . However, data digging is a struggle. The Yahoo Webscope Program is a reference library of interesting and scientifically useful datasets for non-commercial use by academics and other scientists. io: 100+ Interesting Data Sets for Statistics; StaTrek: Leveraging open data to Oct 7, 2014 More than what you can even think of !!! The most important one of course is Datasets | Kaggle. Filename: POVERTY. If you're new to the data space, or if you've recently learned a new skill, or just trying to You can also present information creatively to make it interesting. Please google it for the other open source data set. Minhaz Kazi Last week I came across an all-too-true tweet poking fun at the ubiquity of the Iris dataset. Kai Xin changed description of 100+ Interesting Data Sets for Statistics. The key databases are listed below. Feedback is welcome. Announcing Kaggle integration with Google Data Studio. We retained just the two data sets required to make our point. This page contains downloadeble statistics data. Data Desk embodies the philosophy that “seeing is understanding”. You could Using and Handling Data . government. Data. If you find this information useful, please let us know. Fun Datasets Fri 01 May 2015. An important distinction to make if anyone's planning to do statistical analyses on the set. 6. Mode connects the world’s data and the people who analyze it. We know a good example can make a lesson on a particular statistics method vivid and relevant. Obtained from the Journal of Statistics Education online data archive. Cool Datasets. Related questions ask students to calculate averages for additional data sets. reply Where to find good data sets. 5 million posts - it's the top 1000 posts of each of the top 2500 subreddits. XLS Data for 97 countries, on birth and death rates, infant mortality rates, life expectancies, and per capita GDP. Others come from the Data and Story Library. Amazon’s AWS Public Data Sets page is an overwhelming collection of massive and free data sets. These databases, datasets, and data collections may be maintained by ARS or by ARS in cooperation with other organizations. In order of preference:The Yahoo Webscope Program is a reference library of interesting and scientifically useful datasets for non-commercial use by academics and other scientists. Rows 848. December 16, 2010. Ice cream consumption leads to murder. Now anyone can submit datasets! Submit. 30 Graphical Representations of Data Visualization techniques are ways of creating and manipulating graphical If you are comparing two sets of data, you can use a The discovery of interesting regions in spatial datasets is an important data mining task. Data makes your presentation solid. Some model types, such as time series models . It will be interesting to see the impact of these changes down the road. Inside a hut on remote Whenua Hou/Codfish Island off the coast of New Zealand’s South Island, there’s a chart on the refrigerator depicting the future of a species. our community is bringing much more variety to the ways the world can learn data science. It backs up the ideas you are selling. etc. If you want to analyze he economic activity in your area the Census's County Business Patterns database is cool. In fact, prepare to be frequently 5 Ways to Find Interesting Data Sets Editor’s note: This post was written as part of a collaboration with Enigma , a public data company. Classification (349) Regression (96) Clustering (84) Other (55) Attribute Type. I suppose you can find pretty much anything on the Internet. Author India Kerle is a data curator at Enigma. We've collected articles including whacky and useful data sets for training machine learning models, practicing an analytical language, or finding compelling insights. They are collected and tidied from blogs, answers, and user responses. Although technical in nature, data sets are far from boring. This list of a topic-centric public data sources in high quality. It’s a great list for browsing, importing into our platform, creating new models and just exploring what can be done with different sets of data. If you want to analyze he economic activity in your area the Census's County Business Patterns database is …Here are 10 great datasets on movies. Build data sets and manipulate points to discuss how variables are correlated (includes the Correlation Coefficient, the Coefficient of Determination, and linear relationships). They fail to realize the amount of Since some of the datasets include country data, and they can lead to interesting insights. Comparing Two Data Sets - Independent Practice Worksheet Complete all the problems. A place to share, find, and discuss Datasets. Our Team Terms Privacy Contact/Support Terms Privacy Contact/Support A place to find cool datasets. In fact, the need to The data set comes from the real estate industry in Boston (US). Center for Disease Control and Prevention has Data, statistics, and information on diseases provided by the U. J. discover and share cool data, connect with interesting people, a large catalog of financial data sets. Others come from various R packages. gov. The Movie data sets provide interesting data for use in a wide variety of statistics classes. If you’ve ever worked on a personal data science project, you’ve probably spent a lot of time browsing the internet looking for interesting data sets to analyze. 10 Interesting 2018 Video Game Industry Statistics, Trends Milne Library Data Collections: Open Data Sets by topic Locate and use numeric, statistical, geospatial, and qualitative data sets, find data management templates, find data repositories to house your own data and find tools for data visualization. m4. Data sets of any type: some links. I don’t claim that these datasets are easy to analyze, or that they are as complete as they purport to be. Maybe there are just too many possible questions, maybe it's a pain to set up analytical tools, or maybe it's just too easy to get distracted by animal GIFs. Information is given about the data and an indication is given of statistical techniques that may be useful when working with the data set. Data Index. Instructors of statistics & machine learning programs use movie data instead of dryer & more esoteric data sets to explain key concepts. Probability and Statistics Index. world not just because it met those needs but also because it was easy to access and the platform sets us up for future ways of working with data. The data set lists values for each of the variables, such as height and weight of an object, for each member of the data set. As the charts and maps animate over time, the changes in the world become easier to understand. 0 of the software. jenunderwood The Data And Story Library is brought to you by Data Description, creators of Data Desk: Data Analysis and Exploration software. See the scatterplots and/or create them with your class using sets of data. Infochimps is a data marketplace warehouse. Categorical (38) Numerical (306) Mixed (55 Early biomarkers of Parkinson’s disease based on natural connected speech Data Set . 5 Ways to Find Interesting Data Sets Editor’s note: This post was written as part of a collaboration with Enigma , a public data company. The World Bank, a comprehensive set of data about development in countries around the globe. edit subscriptions. Feb 12, 2016 There are hundreds (if not thousands) of free data sets available, Below is a list of 35 of the most globally interesting I've come across, but Sep 13, 2016 If you've ever worked on a personal data science project, you've probably spent a lot of time browsing the internet looking for interesting data Aug 21, 2018 The first step is to find an appropriate, interesting data set. This information is maintained and utilized by all City departments, City of Edmonton's residents, general public, government and non-government agencies and online GIS user communities. Data Sets for Quantitative Research: Home. View ALL Data Sets: Browse Through: Default Task. Data Sets for Quantitative Research Home Search this Guide Search. Perfect for getting started thanks to the various dataset sizes available. Corrections of any size and links to other collections are welcome. We can learn some surprising facts about various industries that might shock you. gov. reader; The 10 Most Bizarre Correlations. Based on the data is there any reason to believe compost improves the growth of plants? Public Data Commons hosted by Open Science Data Cloud (OSDC) – public data sets of scientific interest, including genomics data, land survey data, Project Gutenberg, Space Weather Prediction data, etc Raw data from online personality tests For general public edification the data collected through the personality tests on this website is dumped here. The Delve datasets and families are available from this page. You already know that data is the bread and butter of reports and presentations. 2018 : Somerville Happiness Survey Unless otherwise noted, our data sets are available under the Creative Commons Attribution 4. Statistical Methods & Data Sources Free Data Sources value, machine readable datasets generated by the Executive Branch of the Federal Government. Yahoo finance URL schemes. 22 May 2010 April 7, we are lucky to have access to more data than ever before, as data sets continue to be made available online for free. Subscribers: 49K Deep and interesting datasets for computational cjlab. Data sets are in various formats, zipped for download. Provided by Data Interview Questions, a mailing list for coding and data interview problems. What are some interesting data sets (with data stories if possible) for survival analysis? What are some good data sets or links of data sets for speech recognition purposes? What are some publicly available sociological and psychological data sets? Interesting datasets and APIs. venganza. 12 thoughts on “ Fun Data for teaching R ” Steven Slezak on January 22, 2016 at 07:17 said: I think this could be really interesting or your students. Here's a json file containing 216,930 Jeopardy questions, answers and other data. Categorical Data Antiseptic as Treatment for Amputation - Upper Limb (Data) Antiseptic as Treatment for Amputation - Upper Limb (Description) Interesting Data Sets A robust data set is usually the first step toward answering a question. This dataset turned out to be fairly interesting given the political aspects behind marijuana legalization. InfoChimps Data Sources for Cool Data Science Projects: Part 1 Posted by Michael Li on October 16, 2014 At The Data Incubator , we run a free eight week data science fellowship to help our Fellows land industry jobs. The Journalists Database of Databases - A good collection of interesting data, mostly government, social, and economic. Here are 10 great datasets on movies. Here are 10 great datasets on movies. Related Reports 0 Here you'll find which of our many data sets are currently available via API. If you find an interesting data source that only provides . The Legacy DBF Files will be deleted at a future date. xls files, you may use Deep and interesting datasets for computational journalists: a quick list Amazon’s AWS Public Data Sets page is an overwhelming collection of massive and free Learn about some of the many interesting social media datasets available to you, some of which are quite new, and the different features and challenges they offer you for your next big data science project. is a series of maps based on data Tutoring to Enhance Science Skills Tutoring Four: Learning to Make Line Graphs a data table with an average distance column and an average velocity column, create The data contains the over £25k expenditure for Doncaster CCG, which needs to be reported under the guidelines of NHS England. The Joyner-Boore Attenuation Data: attitude: The Chatterjee-Price Attitude Data: The AWS Public Dataset Program covers the cost of storage for publicly available high-value cloud-optimized datasets. XLS Data for 97 countries, on birth and death rates, infant mortality rates, life expectancies, and per capita GDP. Variables include year, gender, cause of death, and number of deaths. But even though data is increasingly accessible, it's sometimes hard think up an interesting problem to analyze. com/blog/five-public-datasetThe world is full of interesting datasets. Our Team Terms Privacy Contact/Support Terms Privacy Contact/Support Berkeley economics professor, makes numerous interesting data sets related to taxation and income inequality publicly available. You should decide how large and how messy a data set you want to work with; while Aug 15, 2018 This is a great place for Data Scientists looking for interesting datasets with some preprocessing already taken care of. Thunder Basin Antelope Study Systolic Blood Pressure Data Test Scores for General Psychology Hollywood Movies All Greens Franchise Crime Health The original competition asked participants to provide four separate sets of The interesting about this data is that someone might wish to filter the webpages Data Sets. Genuine data tables found on the web which seem complex or otherwise noteworthy, by Ben ‘Cerbera’ Millard. Some New Interesting Deep Learning Datasets for Data Scientists Posted on March 2, 2017 March 14, 2018 Author Muktabh Mayank Larger tagged datasets and more available computing power is what has triggered the recent AI revolution. e. Each competition provides a data set that's free for download. Causes of Death. What are some interesting data sets available out there? What are some neuroscience datasets open to the public? What are some interesting empirical findings from SocioPatterns datasets? A place to share, find, and discuss Datasets. Miscellaneous Datasets. The home of the U. Open Datasets. gov Datasets for Data Mining and Data Feb 12, 2016 There are hundreds (if not thousands) of free data sets available, Below is a list of 35 of the most globally interesting I've come across, but 2019 Kaggle Inc. We Feel Fine is one of the most interesting visualization tools I came across. To make specific requests for the release of datasets, please sign up and submit your requests on our Developer Forum. totl. They offer to host, sell, and distribute data sets. Government Datasets; Science Datasets; Entertainment Datasets; Machine Learning Datasets; Miscellaneous Datasets; User May 29, 2014 Looking for interesting data sets? Here's a list of more than 100 of the best stuff, from dolphin relationships to political campaign donations to A topic-centric list of HQ open datasets. This research is often inactive due to professional commitments. The data sets that follow are all in CSV format unless otherwise noted. world we are better able to scale our analytics solutions – handling more data at less cost than we could before. In fact, prepare to be frequently A place to share, find, and discuss Datasets. There are a number of different maps available. Unless otherwise noted, our data sets are available under the Creative Commons Attribution 4. Collections of Interesting Data Tables. Use callouts where required. Trainings in February What are the most unexpected, weird, crazy or funny open datasets available online? 100+ interest data sets it will be interesting if you live in the country The Pew Internet & American Life project has some data sets that might be fun. Also, somehow using this for a reddit bot could be interesting with the right idea. What Are Some Statistics Project Topics? Some topics for statistics projects, as suggested by McGraw-Hill Higher Education, include examining the factors that affect the gas mileage of a car, the gender distribution of a grocery store's customers, the physical factors affecting performance in sports and urban planning parameters across Correlated uncovers surprising connections between seemingly unrelated things, and you can help by answering the weekly poll. One focus of research on business forecasting has been empirical validation. Story. In fact, the need to Content: Unlike the traditional Census of Agriculture data set, which only covers a single year, the linked file connects multiple censuses (1986, 1991, 1996, 2001, 2006 and 2011). Classification . The textbook datasets for Mathematics 241 can be found here. Data sets and corpus / corpora for biological We take the most interesting part, the data, and surround it with clutter that only makes it harder to understand what the point is. Particle physics data set. Important, commonly-used datasets in high quality, easy-to-use & open form as data packages GitHub is home to over 31 million developers working together. APIs. 0 of the software. We hope that our readers will make the best use of these by gaining insights into the way The World and our governments work for the sake of the greater good. The Legacy DBF …Monthly Sunspot Data, from 1749 to "Present" sunspot. . That species is the kakapo Agricultural Research Service programs generate many publicly accessible data products that are catalogued in the Ag Data Commons. Policy The data can be used for social research and, as they are well-edited and documented, for teaching (e. Related Reports 0 Mode connects the world’s data and the people who analyze it. 1. We are just at the beginning of a revolution that will touch every business and every life on this planet. A SAS programmer posted an interesting question on a SAS discussion forum. Sign Up; thomas. The publisher of this textbook provides some data sets organized by data type/uses, such as: *data for multiple linear regression *single variable for large or samples *paired data for t-tests *data for one-way or two-way ANOVA * time series data, etc. What Are Some Statistics Project Topics? Some topics for statistics projects, as suggested by McGraw-Hill Higher Education, include examining the factors that affect the gas mileage of a car, the gender distribution of a grocery store's customers, the physical factors affecting performance in sports and urban planning parameters across Data Sets & Tables. This dataset turned out to be fairly interesting given the …r-directory > Reference Links > Free Data Sets Free Datasets. Since movies are universally understood, teaching statistics becomes easier since the domain is not that hard to understand. PM me if link dies. Stata examples and datasets from different sources around the world. Best Price for a New GMC Pickup Cricket Chirps Vs. Important, commonly-used datasets in high quality, easy-to-use & open form as data packages - Data Packaged Core Datasets. Open Datasets. Hand, et al. The M4 will extend and replicate the results of the previous three competitions using a large and diverse set of 100,000 time series. 0 International license, and the code is available under the MIT license. Be sure to check the box that says "Limit to Documents with Excel Spreadsheets". Open exchange rates. A data set (or dataset) is a collection of data. It’s a struggle to look for reputable and legitimate sources You already know that data is the bread and butter of reports and presentations. government executive branch agencies. Infochimps. With data. Home Toggle Dropdown. You can solve them using basic regression or classification algorithms. Every dataset (or family) has a brief overview page and many also have detailed documentation. Learn More; Sign In; Sign UpSome data sources are great for complementing your own data. 10 Great Healthcare Data Sets. Natural Language Processing with Python Source: John Loeffler for Interesting Data Sets (Seattle Central QELP): Data sets for real-world mathematics, examples for science. You We plan on adding more of our publicly available datasets. Data sets and problems for learning hypothesis testing I was hoping someone could suggest some good data sets and problems to work on. PREV DATA SET The Yahoo Webscope Program is a reference library of interesting and scientifically useful datasets for non-commercial use by academics and other scientists. 2. Whatever the case, we want to make it easier to start working on interesting Aug 15, 2017 · Every data is interesting as it carries some information that may be useful for someone. Home » MAA Publications » Periodicals » Loci/JOMA » Classroom-Ready Data Sets in Environmental Math - Introduction Classroom-Ready Data Sets in Environmental Math - Introduction ‹ Classroom-Ready Data Sets in Environmental Math up Classroom-Ready Data Sets in Environmental Math - The QELP Web Site › Interesting behavior of lm() with small, problematic data sets I've recently come across the following results reported from the lm() function when applied to a particular type of admittedly difficult data. The list continues- Data. Datasets for Data Mining and Data Science. NW, Washington, DC. A pirate shortage caused global warming. There wasn't as much open data back then, so I …I chose data. Join them to grow your own development teams, manage permissions, and collaborate on projects Yelp Dataset Challenge for students to conduct research or analysis on our data and share their discoveries with us. Here you'll find which of our many data sets are currently available via API. Nov 07, 2010 · The Pew Internet & American Life project has some data sets that might be fun. How to make statistics interesting, relevant and fun! and particularly focuses on step-by-step SPSS guides and data sets that interesting and fun. You can kind find image May 19, 2017 In trying to do my capstone for the coding bootcamp I'm doing, I found a number of cool data sets which I thought I should share. Other interesting datasets include the chemical composition from wine grown in Italy, Abalone physical characteristics, and heart disease data. Collections of data for developing, evaluating, and comparing learning methods. 11 websites to find free, interesting datasets. The 11 best websites to find interesting, free data sets to analyze. world we are better able to scale our analytics solutions – handling more data at less cost than we could before. We love data, big and small and we are always on the lookout for interesting datasets. Free data set for very high dimensional classification [closed] There is already a community wiki about free data sets: Locating freely available data samples. I posted a few of my favorite ones at http://www. Williams » Economics » Data, A seachable repository of over 180,000 data sets produced by U. Installation Guide Updates FAQs Documentation Register Stata Technical services . gov/Education, central guide for education data resources including high-value data sets, data visualization tools, resources for the classroom, applications created from open data and more. Mar 11, 2010 · Fun With Public Data Google went live with an experimental tool that allows people to explore various public data sets - from unemployment in the U. 2500 . and it is interesting and instructive to have in mind a variety of problems when considering learning methods. Some of their data comes at a cost but a lot of their data is free as well. Bureau of Labor Statistics. Data visualization is the end artifact, but it involves multiple steps – finding reliable data, getting the data in the right format, cleaning it up (an often Read moreData Sets. You Nov 07, 2010 · The Pew Internet & American Life project has some data sets that might be fun. C: This data set consists of computer science bibliography data. and the data is generated from somewhere in the West Coast of the USA. Stata: Data Analysis and Statistical Software Set. 3. Data Sets. All I had to do was combine and organize the CSV cesspool into one nice, neat data file. 34% 1. Finding Data. What is Data? Probability and Statistics Measures of Central Value. Whatever the case, we want to make it easier to start working on interesting Mar 21, 2018 · 5 Ways to Find Interesting Data Sets Editor’s note: This post was written as part of a collaboration with Enigma , a public data company. gov Datasets for Data Mining and Data Feb 12, 2016 There are hundreds (if not thousands) of free data sets available, Below is a list of 35 of the most globally interesting I've come across, but 2019 Kaggle Inc. 10/01/2018; 4 minutes to read; Contributors. The massive increase in the amount of data collected and stored by organizations around the world over the past few decades is undeniable and the ability to access and analyse this data is quickly becoming more and more important. Learn how to join multiple data sets together in Tableau Explore our Tableau Public Youtube playlist and Tableau's other learning resources. by Alket Cecaj on Algorithms and DataFusionSimpsons Characters Data — Images. Data obtained from A Handbook of Small Data Sets, edited by D. In the course of a typical day, I have a lot of information come my way. Subject: I think you should check out Lending Club; it's worth your time has invited you to join Lending Club. Big data is a hot issue in today’s business world. Tableau Public Data Sets for DC Data Science. This is a regression problem. Interesting Data Sets November 6, 2017 November 6, 2017 ~ BaumanS Here is a list of links to data sets and how to find data sets with Fitbit and other applications: CDC WONDER is a system for disseminating Public Health data and information. Delve Datasets Collections of data for developing, evaluating, and comparing learning methods. A great starting point if you need large (US-specific) data sets from 2000-2011 on any of these indicators. Please feel free to add any I may have missed out. Public data sets for testing and prototyping. 84% to 6. Slope on Beach National Unemployment Male Vs. Jan 16, 2015 · Public Data Commons hosted by Open Science Data Cloud (OSDC) – public data sets of scientific interest, including genomics data, land survey data, Project Gutenberg, Space Weather Prediction data, etc© 2019 Kaggle Inc. xls files, you may use Excel to export the data to a . when your chart has a lot of elements. Interesting data points: If you use quarter 2 data for testing, you will notice something interesting in the week ending 5/27/2011 every Users of this service have access to data sets, documentation, and questionnaires from NCHS surveys and data collection systems. Finding an interesting data set and a story it tells can be the most difficult part of producing an infographic or data visualization. The Reddit data isn't actually the top 2. Different modules of EpiQuery, an interactive tool, provide quick answers about specific health indicators broken down by demographics, neighborhoods and in relation to other health indicators. Public Data sets on Amazon AWS Amazon provides following data sets : ENSEMBL Annotated Gnome data, US Census data, UniGene, Freebase dump Data transfer is 'free' within Amazon eco system (within the same zone) AWS data sets. Some cool data sets I have found online I am collecting a list of the most interesting economic data I have found on the web. This is a directory of government, federal, state, city, local and other public datasets. Develop new cloud-native techniques, formats, and tools that lower the cost of working with data. 25+ websites to find datasets for data science projects. Presentation of data with simple animation makes data powerful. The dataset 50 Great Examples of Data Visualization . Subject: I think you should check out Lending Club; it's worth your time has invited you to join Lending Club. icio. These resources will help you learn how to parse through large sets of text data to find the meaning in all the noise. The data sets were collected over various periods of time, depending on the size of the set. of free data sets available, ready to be used and analyzed by anyone willing to look for them. Sep 30, 2015 · Deep and interesting datasets for computational journalists: a quick list. Multivariate, Text, Domain-Theory . All of these ideas depend on free or scraped data, which means that anyone can work on them. Perhaps the best freely available data set of games is known as the “Million Base,” boasting some 2. ufo_sightings. Draws from U. net "No data with a good schema needs to be justified. all; In this article. Probability and Statistics IndexShrinkage Results for Several Laundry Drying Factors Data Description Variation in Cotton Roving by Day, Spindle, and Position Data Description Hairiness of Fabric of Various Twist Levels, Test Speeds, and Bobbins Data DescriptionSustainability data set - supply chain or company practices? 0 . Others are interesting or just fun to play with. An investment at Lending Club can provide solid returns Finding an interesting data set and a story it tells can be the most difficult part of producing an infographic or data visualization. 10 Great Healthcare Data Sets. Census Bureau data and many other sources to provide comprehensive, impartial analyses on a wide range of demographic-related issues. io: 100+ Interesting Data Sets for Statistics; StaTrek: Leveraging open data to Oct 7, 2014 More than what you can even think of !!! The most important one of course is Datasets | Kaggle. Recently, I’ve seen several sets of data that I thought you’d find interesting. Go Stata. co. Conclusion. Fun Facts and Statistics More than 10 people a year are killed by a vending machine. Apart from the UCI repository, you may find other ‘interesting’ datasets here * …Project datasets 🏠 🏫 Since some of the datasets include country data, They are more interesting in that fewer (or no) visualizations are available online yet, and they can lead to interesting insights. Skip to content. Mode connects the world’s data and the people who analyze it. Mar 17, 2016 · hi Rahul, In public website try using domain name and get data set as well as workbooks. I’d welcome ideas/suggestions/additions to the list as well. It’s a struggle to look for reputable and legitimate sources Data Sets Quebec Birth data: Number of births, daily, Quebec, January 1, 1977 to December 31, 1990. g. Interesting Data Sets. ac. S. Data: Government, State, City, Local and Public. csv file only. This data set includes provider data for the hip/knee complication measure, and the Interesting Datasets. In this month's set of hand-picked datasets of the week, Rdatasets is a collection of over 1200 datasets that were originally distributed alongside the statistical software environment R and some of its add-on packages. Data visualization is the end artifact, but it involves multiple steps – finding reliable data, getting the data in the right format, cleaning it up (an often Read moreFinding an interesting data set and a story it tells can be the most difficult part of producing an infographic or data visualization. Downloading instructions are available in “readme” files. The Google Public Data Explorer makes large datasets easy to explore, visualize and communicate. Join them to grow your own development teams, manage permissions, and collaborate on projects. Primarily as an excuse to let you know about the amazing Infochimps website (that catalogues datasets and makes them available), here are some interesting data sets that you might want to explore: I chose data. All data is anonymous. CDC WONDER is a system for disseminating Public Health data and information. 5. Bureau of Labor Statistics (BLS) - Subject Areas Includes current data on: Inflation & Prices, Employment, Unemployment, Pay & Benefits, Spending & Time Use, Productivity, Workplace Injuries, and more. Every single business and organization collects data to answer questions and capture evidence. Lending Club Notes with grades A through E have provided Historical Returns of 4. An important distinction to make if anyone's planning to do statistical analyses on the set. World Inequality Database (formerly Top Incomes Database) . ET Tweet Share Copy 1. A more complicated and interesting problem than it may seem at first glance! A full solution sets out the math in detail. Synthetic data sets are not interesting but neither are they random or unrealistic if built by a competent designer. and you could do some Data sets and templates related to Benford's Law: Chapter 1 to 7: The census data set that is used for several applications. LexisNexis Statistical Universe - Just about everything. The Time Series Data Library (TSDL) was created by Rob Hyndman, Professor of Statistics at Monash University, Australia. Causes of death in France from 2001-2008. It will be interesting to see the impact of Data Sets & Tables. world not just because it met those needs but also because it was easy to access and the platform sets us up for future ways of working with data. The Data Services unit at MU Libraries offers reference assistance to MU faculty, staff and Some cool data sets I have found online I am collecting a list of the most interesting economic data I have found on the web. Stata: Data Analysis and Statistical Software The data can be used for social research and Some Datasets Available on the Web . Aaron Dietz it takes the same two ingredients to get someone started on their data journey: a data set that appeals to the person’s interests, and questions that the data can help answer. This is a very effective way to get your point across in less than 2 seconds, when your chart has a lot of elements. It provides Comparing Two Data Sets - Independent Practice Worksheet Complete all the problems. Everyone A place to find cool datasets. John wants to know if compost improves the growth of plants. badpsychologyblog. All datasets have been reviewed to conform to Yahoo's data protection standards, including strict controls on privacy. Project Cerbera > Web Technology > Study > 2007 > Collections of Interesting Data Tables. But even though data is increasingly accessible, it's sometimes hard think up an interesting problem to analyze. If you want to analyze he economic activity in your area the Census's County Business Patterns database is …The Reddit data isn't actually the top 2. The data set may comprise data for one or more members, corresponding to the number of rows. Statistical Science Data Sets - Large index of data sets from fully processed to raw. Some of these datasets are original and were developed for statistics classes at Calvin College. The data set comes from the real estate industry in Boston (US). We recommend this as a group exercise for upper level students. The City of Edmonton maintains a wide range of data, API's, and interactive charts and maps. Socrata is another interesting As a consequence, today, students of the game benefit from one of the richest data sets of any game or sport. Dow Jones Index Data Set Download: Data Folder, Data Set Description. ca – Data Sets I found that are interesting, and some suggestions 1 Reply Yesterday was the one year anniversary of the Canadian federal government’s open data portal . In this list, we have also provided tutorials to help you get started. The dataset Nonetheless, it's fun to consider the causal relationships one could infer from these correlations. In particular, we are interested in identifying disjoint, contiguous regions that are unusual with respect to the distribution of a given class; i. Posted by Patnab on January 21, 2016 at 12:30pm; View Blog; There is so much great work being done with data quality and data analytics tools. A Challenge to Analyze the World’s Most Interesting Data: The Department of Commerce Publishes its Datasets on Kaggle. year: Yearly Sunspot Data, 1700-1988: sunspots: Monthly Sunspot Numbers, 1749-1983: swiss: Swiss Fertility and Socioeconomic Indicators (1888) DataSee data mining examples, including examples of data mining algorithms and simple datasets, that will help you learn how data mining works and how companies can make data-related decisions based on set rules. By WDD Staff | Jun. Obtained from the Journal of Statistics Education online data archive 2010 Federal STEM Education Inventory Data Set 160 recent views Office of Science and Technology Policy, Executive Office of the President — This data set provides information for STEM education (pre-kindergarten through graduate) investments funded by Federal agencies at the level of $300,000 or above. Big data is not a fad. Sustainability data set - supply chain or company practices? 0 . A data base of long-term historical data on the shares of national incomes going to the top of the income distribution in each of a large number of different The 11 best websites to find interesting, free data sets to analyze. 2011 The Math Forum's Internet Math Library is a comprehensive catalog of Web sites and Web pages relating to the study of mathematics. In this month's set of hand-picked datasets of the week, The AWS Public Dataset Program covers the cost of storage for publicly available high-value cloud-optimized datasets. If you are looking for public data sets to build demos or participate in the Power BI Report Contest. Jan 16, 2015 · Public Data Commons hosted by Open Science Data Cloud (OSDC) – public data sets of scientific interest, including genomics data, land survey data, Project Gutenberg, Space Weather Prediction data, etcData sets for Regression Short Course The first few data sets from the class notes are listed below. FBI Crime Data: The FBI crime data is fascinating and one of the most interesting data sets on this list. Admin 2012/02/29. Yahoo! Webscope - A reference library of interesting and scientifically useful datasets for non-commercial use by academics and other scientists. 01, 2009 Rootzmap—Mapping the Internet is a series of maps based on data sets provided by NASA and created by Philippe Bourcier. If you have your own list of favorite data sources and want to share them, All datasets are well documented, including data set descriptions. Access Washington has Washington State government information and services. The training data is from high-energy collision experiments. Here you will find data, tools, and resources to conduct research, develop web and mobile applications, design data visualizations, and more. The programmer wanted to iterate over hundreds of SAS data sets, read in all the character variables, and then do some analysis. " - Dr. This is a really interesting dataset for Neural Network Style-Transfer Algorithms. As the number of people becoming involved with R and data science increases so does the need for interesting data sets for creating examples, showcasing machine learning algorithms and developing statistical analyses. Pew Research Center makes its data available to the public for secondary analysis after a period of time. my subreddits. Multivariate . 0. These algorithms can be tricky to build, but it would be a very interesting project to try and map real human faces into the style of The Simpsons characters. Subscribers: 49K Five Public Datasets, and Lots of Ideas for Exploring Themhttps://mode. GitHub is home to over 31 million developers working together. J. Numbers. This channelizes the audience attention to where it matters. Datasets and project suggestions: Below are descriptions of several data sets, and some suggested projects. to education statistics in California. That's why I find the Computerworld Data+ Editors' Choice awards so interesting -- they're examples of data use in the wild that are actually helping real businesses, government agencies and other Large data sets exist but they are often implausibly large to move around over the Internet. us. 50 Great Examples of Data Visualization that you’re learning about in a more interesting, and often more useful manner. Interesting Data Sets * If, tomorrow, you get an email congratulating you on your new status as future Jeopardy contestant, how are you going to prepare? Well, one approach might be to download this archive of 216,930 past Jeopardy questions and plug them into your favorite spaced repetition system . interesting data sets jump to content. The format of the json object is described below. Multiple data sets covering human-computer interaction, audio/video, data mining/information retrieval, geospatial/location, natural language processing, and robotics/computer vision. Everyone A place to find cool datasets. Variables van Langren's Data on Longitude Distance between Toledo and Rome 12 9 FALSE TRUE TRUE FALSE CSV : DOC : HistData Macdonell Macdonell's Data on Height and Finger Length of Criminals, used by Gosset (1908) 924 3 FALSE FALSE TRUE FALSE CSV : DOC : HistData MacdonellDF Macdonell's Data on Height and Finger Length of Criminals, used by Gosset (1908 Stack Exchange network consists of 175 Q&A communities including Stack Overflow, the largest, most trusted online community for developers to learn, share their knowledge, and build their careers. I didn't just do it because it was fun to learn how to scrape; it was my only source of interesting data that I could use to practice my analysis skills. Air quality measurements on 41 U. 150 variants to simulate retrofitting techniques. Temperature Diameter of Sand Granules Vs. It would be interesting to analyze the survey data on tuvalabs. C. The rest is dimmed out. Genuine data tables found on the web which seem complex or otherwise noteworthy. See data mining examples, including examples of data mining algorithms and simple datasets, that will help you learn how data mining works and how companies can make data-related decisions based on set rules. Data, Research Resources, & Blogs. Big Data: 33 Brilliant And Free Data Sources Anyone Can Use. Kids + Data: 2 Activities to Introduce the Fun World of Data . DataFerrett , a data mining tool that accesses and manipulates TheDataWeb, a collection of many on-line US Goverment datasets. 25+ websites to find datasets for data science projects. Over the last two years, the BigML team has compiled a long list of sources of data that anyone can use. Feedback. to provide data from a wide variety of topics so that statistics teachers can find real-world examples that will be interesting to Data Sets & Tables. stanford. 5 million posts - it's the top 1000 posts of each of the top 2500 subreddits. The goal is to make these data more broadly accessible for teaching and statistical software development. Fun Facts and Statistics More than 10 people a year are killed by a vending machine. io: 100+ Interesting Data Sets for Statistics; StaTrek: Leveraging open data to 2019 Kaggle Inc. Public Data Sets This page allows you to download Educational Entity Master (EEM) data sets containing public data for the Entity Types selected. The R Datasets Package Documentation for package ‘datasets’ version 3. I've been tagging interesting data I find on the web in del. data. Before using these data sets, please review their README files for the usage licenses and other details. 96 genuine tables. One of the datasets you can find here is the widely used ‘iris’ dataset. cities. In SQL Server Data Tools (SSDT), you build data mining projects using the template, You also have the option to separate your data into a training data set, used for building models, and a holdout data set to use in testing or validating your mining models. 10000 . Data sets are used to analyze everything from climate change to clean energy statistics. This list of a topic-centric public data sources in high quality. Statistical data sets may record as much information as is required by the experiment. com. Data visualization is the end artifact, but it involves multiple steps – finding reliable data, getting the data in the right format, cleaning it up (an often by Joseph Rickert. Based on the data is there any reason to believe compost improves the growth of plants?The M4 will extend and replicate the results of the previous three competitions using a large and diverse set of 100,000 time series. Abstract: This dataset contains weekly data for the Dow Jones Industrial Index. Rdatasets is a collection of over 1200 datasets that were originally distributed alongside the statistical software environment R and some of its add-on packages. Survey posts must contain a URL to the results data which is MovieLens - Rating data sets from the MovieLens web site. Marvin Fenderson K. a region that contains an unusually low or high number of Data Sets. Government’s open data. Here is a post collecting more that 30 links on datasets available online for free. Graphs Index. Posted on April 11, 2013, at 8:56 a. Here are some more cool public data sources you can use for your next project: Data With a Cause: Environmental Data: Data on household energy usage is available as well as NASA Climate Data. Quenneville, Statistics Canada Grocery Shopping Data: The amount that 50 shoppers spent at a grocery store. There are a number of business microdatabases that can be accessed at CDER. Data obtained from A Handbook of Small Data Sets, edited by D. 2010 Federal STEM Education Inventory Data Set 160 recent views Office of Science and Technology Policy, Executive Office of the President — This data set provides information for STEM education (pre-kindergarten through graduate) investments funded by Federal …Yelp Dataset Challenge Discover what insights lie hidden in our data. thomas Updated over 4 years ago. examples of topics we find interesting One of the biggest obstacles to successful projects has been getting access to interesting data. You can kind find image May 19, 2017 In trying to do my capstone for the coding bootcamp I'm doing, I found a number of cool data sets which I thought I should share. If you’re interested in analyzing time series data, you can use it to chart changes in crime rates at the national level over a 20-year period . diy / education / projects / R. You are encouraged to select and flesh out one of these projects, or make up you own well-specified project using these datasets. 2 million matches