Posts by Category: Statistics Resources and Big Data

ReDash – Make Your Company Data Driven

September 19, 2017

ReDash – Make Your Company Data Driven
https://redash.io/

Redash is an open source tool for teams to query, visualize and collaborate. Redash is quick to setup and works with any data source you might need so you can query from anywhere in no time. Share your results and dashboards with other team members and empower your entire organization to be data driven with no-code filters and parameters that instantly adjust. Get alerts for pre-defined triggers to your email, Slack or Hipchat (you can setup a custom webhook as well). Redash is our take on freeing the data within our company in a way that will better fit our culture and usage patterns. We tried to use traditional BI suites and discovered a set of bloated, technically challenged and slow tools/flows. What we were looking for was a more hacker’ish way to look at data, so we built one. Redash was built to allow fast and easy access to billions of records, that we process and collect using Amazon Redshift (“petabyte scale data warehouse” that “speaks” PostgreSQL). Today Redash has support for querying multiple databases, including: Redshift, Google BigQuery,Google Spreadsheets, PostgreSQL, MySQL, Graphite, Axibase Time Series Database and custom scripts. Main Features include: 1) Query editor – enjoy all the latest standards like auto-complete and snippets. Share both your results and queries to support an open and data driven approach within the organization; 2) Visualization – once you have your dataset, select one of our /9 types of visualizations/ for your query. You can also export or embed it anywhere; 3) Dashboard – combine several visualizations into a topic targeted dashboard; 4) Alerts – get notified via email, Slack, Hipchat or a webhook when your query’s results need attention; and 5) API – anything you can do with the UI, you can do with the API. Easily connect results to other systems or automate your workflows. This will be added to Entrepreneurial Resources Subject Tracer™. This will be added to the tools section of Research Resources Subject Tracer™. This will be added to Statistics Resources and Big Data Subject Tracer™.

24 views

Updated> Statistics Resources and Big Data 2018 White Paper Dataset Link Compilation

September 18, 2017

Updated> Statistics Resources and Big Data 2018 White Paper Dataset Link Compilation
http://www.StatisticsResources.com/

I have just updated my white paper dataset link compilation for Statistics Resources and Big Data 2018 Subject Tracer™ by Marcus P. Zillman, M.S., A.M.H.A. It is now a 34 page .pdf document 278KB. [Completely updated with all links validated and new URLs added on September 11, 2017] Other white papers are available by clicking here.

This research is powered by Subject Tracer Bots™ from the Virtual Private Library™. Isn't yours?

29 views

Trifacta – Data Wrangling

August 18, 2017

Trifacta – Data Wrangling
https://www.trifacta.com/

Trifacta’s mission is to create radical productivity for people who analyze data. They are deeply focused on solving for the biggest bottleneck in the data lifecycle, data wrangling, by making it more intuitive and efficient for anyone who works with data. This will be added to Statistics Resources and Big Data Subject Tracer™. This will be added to Entrepreneurial Resources Subject Tracer™.

128 views

Kaggle – Home of Data Science and Machine Learning

August 07, 2017

Kaggle – Home of Data Science and Machine Learning
https://www.kaggle.com/

Kaggle helps you learn, work and play. Features include: a) Competitions – Climb the world’s most elite machine learning leader boards, b) Datasets – Explore and analyze a collection of high quality public datasets, and c) Kernels – Run code in the cloud and receive community feedback on your work. This will be added to Artificial Intelligence Resources Subject Tracer™. This will be added to Script Resources Subject Tracer™. This will be added to the tools section of Research Resources Subject Tracer™. This will be added to Statistics Resources and Big Data Subject Tracer™.

62 views

The Magazine of Early American Datasets

July 24, 2017

The Magazine of Early American Datasets
http://repository.upenn.edu/mead

The University of Pennsylvania Libraries offers the Magazines of Early American Datasets (MEAD), a collection of datasets for researchers of early American history. The datasets are collected from organizations such as the American Antiquarian Society as well as from individual scholars. Visitors can download datasets in whatever format their original authors used or as comma separated variables (.csv). Each entry also includes a codebook, allowing researchers to use this data with ease. One highlight of this collection is two early nineteenth century admissions books from the Eastern State Penitentiary, transcribed by Scott Ziegler of the American Philosophical Society and Michelle Ziogas of Drexel University. This dataset includes intake notes on each incarcerated individual. For example, the notes on a Philadelphia blacksmith charged with burglary reads: “Seems like an old convict & very insensible. No wish to intercourse with me on religious subjects.” Other data sets include 19th Century American Children’s Book Trade Directory (courtesy of the American Antiquarian Society); a collection of George Washington’s shipping invoices; and the 1790 Census of black individuals living in Philadelphia. This will be added to the tools section of Research Resources Subject Tracer™. This will be added to Statistics Resources and Big Data Subject Tracer™. This will be added to Reference Resources Subject Tracer™. Copyright © 2017 Internet Scout Research Group – http://scout.wisc.edu

58 views

DataPortals.org

July 24, 2017

DataPortals.org
http://www.DataPortals.org/

DataPortals.org is the most comprehensive list of open data portals in the world. It is curated by a group of leading open data experts from around the world – including representatives from local, regional and national governments, international organizations such as the World Bank, and numerous NGOs. This will be added to Open Datasets Subject Tracer™. This will be added to Statistics Resources and Big Data Subject Tracer™. This will be added to Entrepreneurial Resources Subject Tracer™. This will be added to Business Intelligence Resources Subject Tracer™. This will be added to Deep Web Research and Discovery Resources.

76 views

Enigma Public – World’s Broadest Collection of Public Data

July 17, 2017

Enigma Public – World’s Broadest Collection of Public Data
https://public.enigma.com/

Enigma is an operational data management and intelligence company. They believe in curiosity and the power of discovery. Our mission is to empower people to interpret and improve the world around them. To deliver on that ambitious goal, we place data into the context of the real world and make it connected, open, and actionable. Our repository of public data informs and trains each of our enterprise offerings. Enigma Public is the world’s broadest collection of public data. Take a tour to see everything you can do in the Public platform. This will be added to Open Datasets Subject Tracer™. This will be added to Statistics Resources and Big Data Subject Tracer™. This will be added to Entrepreneurial Resources Subject Tracer™. This will be added to Business Intelligence Resources Subject Tracer™.

76 views

US Government Web Services and XML Data Sources

June 13, 2017

US Government Web Services and XML Data Sources
http://usgovxml.com/

USGovXML.com is an index to publicly available web services and XML data sources that are provided by the US government. USGovXML.com indexes data sources from all 3 branches of government as well as its boards, commissions, corporations and independent agencies. This will be added to Open Datasets Subject Tracer™. This will be added to Deep Web Research and Discovery Resources 2017. This will be added to Entrepreneurial Resources Subject Tracer™. This will be added to Statistics Resources and Big Data Subject Tracer™. This will be added to New Economy Resources 2017.

122 views

Scientific Data Repository – Real Time Visualization and Exploration Techniques

June 13, 2017

Scientific Data Repository – Real Time Visualization and Exploration Techniques
http://www.mlvis.com/platform.php

This project is the first to combine the notion of a data repository with real-time visual analytics for interactive data mining and exploratory analysis on the web. State-of-the-art statistical techniques are combined with real-time data visualization giving the ability for researchers to seamlessly find, explore, understand, and discover key insights in a large number of public donated data sets. This large comprehensive collection of data is useful for making significant research findings as well as benchmark data sets for a wide variety of applications and domains and includes relational, attributed, heterogeneous, streaming, spatial, and time series data as well as non-relational machine learning data. All data sets are easily downloaded into a standard consistent format. We also have built a multi-level interactive visual analytics engine that allows users to visualize and interactively explore the data in a free-flowing manner. This will be added to Open Datasets Subject Tracer™. This will be added to Deep Web Research and Discovery Resources 2017. This will be added to Entrepreneurial Resources Subject Tracer™. This will be added to Statistics Resources and Big Data Subject Tracer™. This will be added to Online Research Browsers 2017.

136 views

CDC: 500 Cities Project

June 12, 2017

CDC: 500 Cities Project
https://www.cdc.gov/500Cities

From the Centers for Disease Control and Prevention (CDC), in collaboration with the Robert Wood Johnson Foundation and the CDC Foundation, comes the 500 Cities Project, an interactive map that allows researchers to quickly access and analyze health data from the 500 largest cities in the United States. Visitors can browse this information, which is drawn from census and city data, in a number of ways. Researchers interested in learning more about data by region can select an individual state on the map. From there, select from three categories (Health Outcomes; Prevention; Unhealthy Behaviors) to view a variety of specific data measurements for cities in that state. Measurements include rates of high blood pressure, smoking, and health insurance coverage, to name just a few. Alternatively, visitors may also chose to simultaneously view all data in the United States for a particular measurement (e.g. prevalence of arthritis). Data is color coded, allowing visitors to compare cities at first glance, or visitors may hover their cursor over any individual city to view available data. In addition to this map feature, researchers can also request a comparison report for two cities, download a PDF copy of 28 data maps, or download all available data. This will be added to Healthcare Resources Subject Tracer™. This will be added to Statistics Resources and Big Data Subject Tracer™. Copyright © 2017 Internet Scout Research Group – http://scout.wisc.edu

110 views