Posts by Category: Data Mining Resources

Text Mining

June 27, 2017

Text Mining
http://www.istl.org/17-spring/internet.html

Text Mining from Science and Technology Resources on the Internet by Kristen Cooper. Taken from the overview: “As defined by Bernard Reilly (2012), president of the Center for Research Libraries, text mining is “the automated processing of large amounts of digital data or textual content for the purpose of information retrieval, extraction, interpretation, and analysis.” The first step is to find or build a corpus, or the collection of text that a researcher wishes to work with. Most often researchers will need to download this corpus to either their computers or an alternative storage platform. Once this has been done, different tools can be used to find patterns, biases, and other trends that are present in the text (Reilly 2012). Within higher education, text mining is most often found among the digital humanities and linguistics studies. However it is growing in popularity in the science and technology fields. It is possible to find many examples of how text mining is beginning to be utilized in the sciences. It allows users to search across a large set of documents to find connections that would be prohibitively expensive in terms of time to attempt to read individually. An example of this can be seen in the biomedical sciences where Frijters et al. (2010) used text mining to search in MedLine for drugs that could interfere with cell proliferation. Another example can be found with the works of the EXFOR library, which contains experimental nuclear reaction data. Hirdt and Brown (2016) used text mining to build a graph of the relationships between the reactions in the library. They were then able to use this information to identify reactions that are important to researchers but have been understudied. Text mining also has an ability to discover themes and relationships within a corpus through a technique called topic modeling. In a 2016 research study, the authors use topic modeling to determine the proportion of the analyzed text discussing a specific phenomenon, in this case forest fragmentation, and to determine the concepts that are most strongly associated with this phenomenon (Nunez-Mir et al. 2016). In an example from environmental science, Grubert and Siders (2016) use topic modeling to find empirical support for the theory that climate change has become an important topic in environmental lifecycle assessment over time, and revealed a secondary finding of this increase coming at the expense of attention to human health. Finally, the sheer amount of information available to researchers, educators, and scholars makes it increasingly difficult to stay current on a particular topic or field. Anne Okerson (2013) points out that text mining can be a useful and time saving factor in doing a systematic review. Text mining therefore presents librarians with the opportunity to develop skills in a new area that has the potential to be of great use to patrons.” This will be added to Data Mining Resources Subject Tracer™. This will be added to the tools section of Research Resources Subject Tracer™. This will be added to Entrepreneurial Resources Subject Tracer™.

62 views

Awareness Watch Talk Show for Wednesday May 3, 2017 at 2:00pm EDST – Data Mining Resources 2017

May 03, 2017

Awareness Watch Talk Show for Wednesday May 3, 2017 at 2:00pm EDST – Data Mining Resources 2017
http://www.BlogTalkRadio.com/AwarenessWatch/

This program will be featuring my just updated Data Mining Resources 2017 2017 . We will be highlighting the latest and greatest resources and sources for data mining covering search engines, subject directories, articles, guides and tracers….literally everything on the Internet for DATA MINING!! We will also discussing my latest freely available Awareness Watch Newsletter V15N5 May 2017 featuring Searching the Internet 2017 – The Primer as well as my freely available May 2017 Zillman Column highlighting Privacy Resources 2017. You may call in to ask your questions at (718)508-9839. The show is live and thirty minutes in length starting at 2:00pm EDST on Wednesday, May 3, 2017 and then archived for easy review and access. Listen, Call and Enjoy!!

125 views

Updated> Data Mining Resources 2017 Whitepaper Dataset Link Compilation

April 29, 2017

Updated> Data Mining Resources 2017 Whitepaper Dataset Link Compilation
http://www.DataMiningResources.info/

I have just updated my Data Mining Resources 2017 Subject Tracer™ Whitepaper Dataset Link Compilation and it is now a 33 page (287KB) .pdf white paper document is available from the above URL link. It lists alphabetically the latest resources and sources for data mining available from the Internet.[Completely updated with all links validated and new URLs added on April 29, 2017] Additional white papers and resources by Marcus P. Zillman are available by clicking here.

122 views

Updated> Data Mining Resources 2017 Whitepaper Dataset Link Compilation

February 13, 2017

Updated> Data Mining Resources 2017 Whitepaper Dataset Link Compilation
http://www.DataMiningResources.info/

I have just updated my Data Mining Resources 2017 Subject Tracer™ Whitepaper Dataset Link Compilation and it is now a 33 page (287KB) .pdf white paper document is available from the above URL link. It lists alphabetically the latest resources and sources for data mining available from the Internet.[Completely updated with all links validated and new URLs added on February 13, 2017] Additional white papers and resources by Marcus P. Zillman are available by clicking here.

179 views

Updated> Data Mining Resources 2017 Whitepaper Dataset Link Compilation

December 22, 2016

Updated> Data Mining Resources 2017 Whitepaper Dataset Link Compilation
http://www.DataMiningResources.info/

I have just updated my Data Mining Resources 2017 Subject Tracer™ Whitepaper Dataset Link Compilation and it is now a 33 page (287KB) .pdf white paper document is available from the above URL link. It lists alphabetically the latest resources and sources for data mining available from the Internet.[Completely updated with all links validated and new URLs added on December 22, 2016] Additional white papers and resources by Marcus P. Zillman are available by clicking here.

370 views

Updated> Data Mining Resources 2016 Whitepaper Dataset Link Compilation

September 28, 2016

Updated> Data Mining Resources 2016 Whitepaper Dataset Link Compilation
http://www.DataMiningResources.info/

I have just updated my Data Mining Resources 2016 Subject Tracer™ Whitepaper Dataset Link Compilation and it is now a 33 page (306KB) .pdf white paper document is available from the above URL link. It lists alphabetically the latest resources and sources for data mining available from the Internet.[Completely updated with all links validated and new URLs added on September 28, 2016] Additional white papers and resources by Marcus P. Zillman are available by clicking here.

318 views

July 2016 Zillman Column – Data Mining Resources 2016

June 18, 2016

July 2016 Zillman Column – Data Mining Resources 2016
http://columns.virtualprivatelibrary.net/Data_Mining_Resources_2016_July16_Column.pdf
http://www.zillmancolumns.com/

The July 2016 Zillman Column features Data Mining Resources 2016 by Marcus P. Zillman, M.S., A.M.H.A.; Executive Director of the Virtual Private Library. This is a comprehensive listing of data mining directories, subject guides and index resources and sites available on the Internet. Download this excellent freely available 33 page column 288KB today. These resources and sources will help you to discover the many pathways available through the Internet to find the latest data mining resources and sites. This is another MUST have column to discover these data mining resources in today’s ever changing New Economy world!!

This research is powered by Subject Tracer Bots™ available from the Virtual Private Library™.

437 views

Updated> Data Mining Resources 2016 Whitepaper Dataset Link Compilation

June 01, 2016

Updated> Data Mining Resources 2016 Whitepaper Dataset Link Compilation
http://www.DataMiningResources.info/

I have just updated my Data Mining Resources 2016 Subject Tracer™ Whitepaper Dataset Link Compilation and it is now a 33 page (299KB) .pdf white paper document is available from the above URL link. It lists alphabetically the latest resources and sources for data mining available from the Internet.[Completely updated with all links validated and new URLs added on June 1, 2016] Additional white papers and resources by Marcus P. Zillman are available by clicking here.

456 views

MOA (Massive Online Analysis)

May 07, 2016

MOA (Massive Online Analysis)
http://moa.cms.waikato.ac.nz/

MOA is the most popular open source framework for data stream mining, with a very active growing community (blog). It includes a collection of machine learning algorithms (classification, regression, clustering, outlier detection, concept drift detection and recommender systems) and tools for evaluation. Related to the WEKA project, MOA is also written in Java, while scaling to more demanding problems. MOA performs BIG DATA stream mining in real time, and large scale machine learning. MOA can be extended with new mining algorithms, and new stream generators or evaluation measures. The goal is to provide a benchmark suite for the stream mining community. This will be added to Data mining Resources Subject Tracer™. This will be added to Artificial Intelligence Resources Subject Tracer™.

476 views

SPMF – Open Source Data Mining Library

February 27, 2016

SPMF – Open Source Data Mining Library
http://www.philippe-fournier-viger.com/spmf/

SPMF is an open-source data mining mining library written in Java, specialized in pattern mining. It is distributed under the GPL v3 license. It offers implementations of 112 data mining algorithms for: a) association rule mining; b) itemset mining; c) sequential pattern mining; d) sequential rule mining; e) sequence prediction; f) high-utility pattern mining; and g) clustering and classification. The source code of each algorithm can be integrated in other Java software. Moreover, SPMF can be used as a standalone program with a simple user interface or from the command line. The current version is v0.98 and was released the 14th January 2016. This will be added to Data Mining Resources Subject Tracer™.

540 views