AZSecure-data.org – Intelligence and Security Informatics Data Sets
Data Science Testbed for Security Researchers. This portal is available to the ISI community to support research. This service started by offering browsing access to downloadable forums from the Artificial Intelligence Lab’s Dark Web and Geo Web collections, which presently includes nearly 40 million postings. Each forum collection contains millions of postings from hundreds of thousands of authors, and may be in English, Arabic, French, German, Indonesian, Pashto, Russian or Urdu, depending on the forum. The repository also includes a large collection of Internet phishing websites from the University of Virginia, with collections of Escrow, Financial, and Pharmacy sites. Recent additions to the repository include hacker forums in English and Russian, Chinese underground market forums, and chat logs that can be used in the study of underground behavior and how hackers learn from each other, the formation of social networks, relationships with the underground economy, and more. The Patriot, militia, hate and linked websites collection based off the Southern Poverty Law Center’s 2009 list can be used to study rhetoric and communication, group dynamics, extreme social movements, and other topics, in information and the social sciences. All data sets can be downloaded freely for non-commercial education and research use. his NSF-funded Data Infrastructure Building Blocks project is intended to address a large gap in the availability of open source research data for researchers in ISI. The University of Arizona Artificial Intelligence Lab and its partners, the University of Virginia, the University of Texas at Dallas, Drexel University, and the University of Utah, were awarded $1,499,531 for a three-year Pilot Demonstration Project to make available a significant archive of data and analysis tools to serve the ISI community. The primary focus of the AZSecure-data project is on data collection and management, access, and data analysis. The goal is to identify and collect data that will be of the highest interest to the research community and providing the data to the community in the easiest, most useful, and most direct way possible. This will be added to Deep Web Research and Discovery Resources 2020. This will be added to Business Intelligence Resources Subject Tracer™. This will be added to Entrepreneurial Resources Subject Tracer™. This will be added to the tools section of Research Resources Subject Tracer™ .