Posts by Category: Data Mining Resources

Wallmine – Wall Street Data Mining

February 01, 2018

Wallmine – Wall Street Data Mining
https://wallmine.com/

After successfully using their own tools to screen and monitor their clients’ and their stock investments they decided to let anyone benefit from what they have built. They founded wallmine in fall 2017 to help anyone gain insights and understand the abundant stock market data and information. They are fusing their financial expertise with technical talent and bringing an alternative to the expensive, inflexible and old-fashioned tools they had used to track their investments. Their team is small, fast, and they obsessively care about the people using our product. They consider wallmine to be our best investment yet. Aallmine helps you research investment opportunities faster. Use their stock screener, stock charts, heatmaps, cryptocurrency screener, or their portfolios. Their users typically love the fast navigation, comprehensive screeners, data insights, and portfolio tracking. This will be added to Financial Sources 2018 Subject Tracer™. This will be added to Business Intelligence Resources Subject Tracer™. This will be added to Entrepreneurial Resources Subject Tracer™. This will be added to the tools section of Research Resources Subject Tracer™.

45 views

Contentmine – Text and Data Mining Open Source Tools

December 07, 2017

Contentmine – Text and Data Mining Open Source Tools
http://contentmine.org/

Contentmine is a leading text and data mining company, with headquarters in Cambridge, England. We specialise in building open source tools to enable our clients to find facts hidden in information. Content Mine is now established as a leading supplier of text and data mining tools to both Higher Education and Knowledge based organizations , with a broad portfolio of clients. This will be added to Business Intelligence Resources Subject Tracer™. This will be added to Entrepreneurial Resources Subject Tracer™. This will be added to the tools section of Research Resources Subject Tracer™.  This will be added to Data Mining Resources Subject Tracer™. This will be added to Information Quality Resources Subject Tracer™.

109 views

OpenMinted – Open Service Oriented e-Infrastructure for Scientific and Scholarly Text and Data Mining

November 02, 2017

OpenMinted – Open Service Oriented e-Infrastructure for Scientific and Scholarly Text and Data Mining
http://openminted.eu/

OpenMinted sets out to create an open, service-oriented e-Infrastructure for Text and Data Mining (TDM) of scientific and scholarly content. Researchers can collaboratively create, discover, share and re-use Knowledge from a wide range of text-based scientific related sources in a seamless way. This will be added to Data Mining Resources Subject Tracer™. This will be added to the tools section of Research Resources Subject Tracer™.

107 views

Awareness Watch Newsletter V1511 November 2017

October 28, 2017

Awareness Watch Newsletter V1511 November 2017
http://AwarenessWatch.VirtualPrivateLibrary.net/V15N11.pdf
Awareness Watch™ Newsletter Blog and Archives
http://www.AwarenessWatch.com/

The November 2017 V15N11 Awareness Watch Newsletter is a freely available 56 page .pdf document (416KB) from the above URL. This month’s featured report covers my Data Mining Resources 2018 and is a comprehensive listing of data mining search engines, directories, subject guides and index resources and sites on the Internet. The below list of sources is taken from my Subject Tracer™ Information Blog titled Data Mining Resources Resources and is constantly updated with Subject Tracer™ bots at the following URLs: http://www.DataMiningResources.info/. These resources and sources will help you to discover the many pathways available through the Internet to find the latest data mining resources and sites. As this site is constantly updated it would be to your benefit to bookmark and return to the above URL frequently. The Awareness Watch Spotters cover many excellent and newly released annotated current awareness research sources and tools as well as the latest identified Internet happenings and resources including a number of really neat and must-have tools! The Awareness Watch Article Review covers Why Blogs Endure: A Study of Recent College Graduates and Motivations for Blog Readership by Alison J. Head, Michele Van Hoeck, and Kirsten Hostetler.

113 views

GROBID

October 09, 2017

GROBID
https://github.com/kermitt2/grobid

GROBID (or Grobid) means GeneRation Of BIbliographic Data. GROBID is a machine learning library for extracting, parsing and re-structuring raw documents such as PDF into structured TEI-encoded documents with a particular focus on technical and scientific publications. First developments started in 2008 as a hobby. In 2011 the tool has been made available in open source. Work on GROBID has been steady as side project since the beginning and is expected to continue until at least 2020.

258 views

Data Mining Resources 2018 Whitepaper Dataset Link Compilation

October 05, 2017

Data Mining Resources 2018 Whitepaper Dataset Link Compilation
http://www.DataMiningResources.info/

I have just updated my Data Mining Resources 2018 Subject Tracer™ Whitepaper Dataset Link Compilation and it is now a 33 page (286KB) .pdf white paper document is available from the above URL link. It lists alphabetically the latest resources and sources for data mining available from the Internet.[Completely updated with all links validated and new URLs added on October 5, 2017] Additional white papers and resources by Marcus P. Zillman are available by clicking here.

145 views

ELKI: Environment for Developing KDD-Applications Supported by Index-Structures

July 29, 2017

ELKI: Environment for Developing KDD-Applications Supported by Index-Structures
https://elki-project.github.io/

ELKI is an open source (AGPLv3) data mining software written in Java. The focus of ELKI is research in algorithms, with an emphasis on unsupervised methods in cluster analysis and outlier detection. In order to achieve high performance and scalability, ELKI offers data index structures such as the R*-tree that can provide major performance gains. ELKI is designed to be easy to extend for researchers and students in this domain, and welcomes contributions of additional methods. ELKI aims at providing a large collection of highly parameterizable algorithms, in order to allow easy and fair evaluation and benchmarking of algorithms. This will be added to Data Mining Resources Subject Tracer™.

176 views

Mallet – MAchine Learning for LanguagE Toolkit

July 29, 2017

Mallet – MAchine Learning for LanguagE Toolkit
http://mallet.cs.umass.edu/

MALLET is a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, information extraction, and other machine learning applications to text. MALLET includes sophisticated tools for document classification: efficient routines for converting text to “features”, a wide variety of algorithms (including Naïve Bayes, Maximum Entropy, and Decision Trees), and code for evaluating classifier performance using several commonly used metrics. In addition to classification, MALLET includes tools for sequence tagging for applications such as named-entity extraction from text. Algorithms include Hidden Markov Models, Maximum Entropy Markov Models, and Conditional Random Fields. These methods are implemented in an extensible system for finite state transducers. This will be added to Data Mining Resources Subject Tracer™. This will be added to Artificial Intelligence Resources Subject Tracer™.

166 views

Deep Learning for Java – Open Source, Distributed, Deep Learning Library for the JVM

July 28, 2017

Deep Learning for Java – Open Source, Distributed, Deep Learning Library for the JVM
https://deeplearning4j.org/

Deeplearning4j is the first commercial-grade, open-source, distributed deep-learning library written for Java and Scala. Integrated with Hadoop and Spark, DL4J is designed to be used in business environments on distributed GPUs and CPUs. Deeplearning4j aims to be cutting-edge plug and play, more convention than configuration, which allows for fast prototyping for non-researchers. DL4J is customizable at scale. Released under the Apache 2.0 license, all derivatives of DL4J belong to their authors. DL4J can import neural net models from most major frameworks via Keras, including TensorFlow, Caffe, Torch and Theano, bridging the gap between the Python ecosystem and the JVM with a cross-team toolkit for data scientists, data engineers and DevOps. This will be added to Data Mining Resources Subject Tracer™. This will be added to Artificial Intelligence Resources Subject Tracer™.

227 views

Weka 3: Data Mining Software in Java

July 28, 2017

Weka 3: Data Mining Software in Java
http://www.cs.waikato.ac.nz/ml/weka/index.html

Weka is a collection of machine learning algorithms for data mining tasks. The algorithms can either be applied directly to a dataset or called from your own Java code. Weka contains tools for data pre-processing, classification, regression, clustering, association rules, and visualization. It is also well-suited for developing new machine learning schemes. Found only on the islands of New Zealand, the Weka is a flightless bird with an inquisitive nature. The name is pronounced like this, and the bird sounds like this. Weka is open source software issued under the GNU General Public License. They have put together several free online courses that teach machine learning and data mining using Weka. Check out the website for the courses for details on when and how to enroll. The videos for the courses are available on Youtube. Yes, it is possible to apply Weka to big data! This will be added to Data Mining Resources Subject Tracer™. This will be added to Artificial Intelligence Resources Subject Tracer™.

173 views