HPCC is the big data analysis tool from LexisNexis. They have proprietary add-ons as well. They've got a meetup and have spoken at local users' groups. I don't see any upcoming meetups, but keep an eye on them.
This was the note from Trish, their program manager:
Hello, I am the Program Manager for the HPCC Systems open source project at LexisNexis. HPCC Systems (www.hpccsystems.com) from LexisNexis Risk Solutions offers a proven, data-intensive supercomputing platform designed for the enterprise to process and solve Big Data analytical problems. As an alternative to legacy technology, HPCC Systems offers a consistent data-centric programming language, two processing platforms and a single, complete end-to-end architecture for efficient processing. And it is open source! ... I'm happy to provide whitepapers, articles or any other information needed.
Upcoming HPCC Meetups
Guest Speaker: Ft Lauderdale Machine Learning Meetup Group
Big Data Processing & Analytics - HPCC Systems (LexisNexis)
We are excited to sponsor and have a guest speaker at the Ft Lauderdale Machine Learning Meetup group again!
IMPORTANT - please RSVP here:
Moving Towards Deep Learning Algorithms on HPCC Systems
20 minutes - Welcome & Mingling
50 minutes - Presentation
20 minutes - Q&A / Open Discussion
Deep Learning has become a major breakthrough in Artificial Intelligence in recent years. This talk will include a study on implementing L-BFGS, which is a popular optimization algorithm for parameter estimation in machine learning. The speaker will explain how HPCC systems provides the platform as well as the programming language (ECL) in order to implement a parallelized and distributed version of L-BFGS algorithm which work on Big Data. L-BFGS algorithm can be used in not only implementing deep learning algorithms, but also any other Machine Learning algorithm which needs estimating parameters based on optimizing a cost function, such as SVM, Logistic regression, etc.,. At the end the implementations of Sparse Autoencoder algorithm and Softmax classifier are explained as a demonstration on how to use L-BFGS for parameter estimation in such algorithms.
Maryam M. Najafabadi received her B.S. degree in Computer Science from Isfahan University of Technology and her M.S. degree in Artificial Intelligence from Amirkabir University of Technology in 2008 and 2011 respectively. Currently, she is a Ph.D. candidate in the Department of Computer and Electrical Engineering and Computer Science at Florida Atlantic University. Her research interests include Data Mining and Machine Learning. Since 2013 Maryam has been involved in the collaborative research between FAU and LexisNexis where her primary task is implementing machine learning algorithms, focusing on Neural Networks and Deep Learning.
Ft Lauderdale, FL - USA
Wednesday, March 29 at 6:30 PM
The Download: Community Tech Talks Episode 3
Big Data Processing & Analytics - HPCC Systems (LexisNexis)
The HPCC Systems Download - Community Tech Talks!
An workshop series specifically designed for the community by the community to share knowledge, spark innovation and further build and link the relationships within our HPCC Systems community. Each series will feature 20 minute talks from 3-4 speakers.
IMPORTANT: Please register at the Webinar link here.
Episode 3 is scheduled for March 30 at 11am ET.
- Flavio Villanustre, VP Technology, LexisNexis Risk Solutions
- Joselito (Joey) Chua , PhD, Manager Software Engineer, Optimal Decisions Group
• Prescriptive Analytics - a Software Engineering Perspective
Prescriptive tools are key components in realising the value proposition of data analytics and business intelligence solutions. The ultimate goal of analytics is to improve outcomes using insights from data. Descriptive tools summarise what has happened, whereas predictive tools model what is likely to happen. Prescriptive tools, on the other hand, suggest a course of action that will likely achieve the best outcomes. This talk presents an overview of prescriptive techniques involving simulation and optimisation, the engineering challenges in building prescriptive tools, and HPCC solutions for those challenges.
Joselito (Joey) Chua leads the software engineering team in the Optimal Decisions Group in LexisNexis Risk Solutions. He specialises in information-theoretic approaches to machine learning. He is a fan of anime.
- Jill Luber, Senior Architect, LexisNexis Risk Solutions
• Migrating an ECL code repository into Git, Part II
This session will take a quick look at a migration plan that moved ECL production code, production processes and developers out of MySQL/SVN and into a Git code management culture. This includes migrating both Roxie and Thor processes to use Git branches across multiple HPCC Systems environments, all while continuing production data builds and releases.
Jill Luber is a Senior Architect for LexisNexis Risk Solutions with leadership responsibility for strategy, implementation, and stability of all US and international data linking products, including the LexID, Business LexID, Healthcare Provider ID, and UK LexID. With 17 engineers across multiple geographies, the Linking Team develops the core competences underpinning all products at Risk executing on the HPCC big data platform. Jill has presented patented, Risk linking concepts at the 2015 RELX leadership conference as well as the RELX Board meeting. She has been a member of the technology organization for over 13 years.
- Michael Gardner, Software Engineer II, LexisNexis Risk Solutions
• HPCC Systems Platform: Java APIs and tools
This presentation will be in regards to the Java API and tools released by the HPCC Systems Platform team. These projects include wsclient, rdf2hpcc, clienttools, and jdbc. These open source projects, which can be found in the hpcc-systems github repositories, are designed to allow downstream developers a consistent means by which to interface with the HPCC Systems Platform. And to facilitate the workflow of common tasks a downstream developer might be concerned with.
Michael Gardner is a HPCC Systems Platform team member and developer. He is responsible for the HPCC Platform init system, various build issues, administrative scripts, and HPCC Java projects. His most recent active work includes systemd integration for the HPCC Systems Platform, and an antlr3 (c) to antlr4 (cpp) migration for the wssql project.
- Bob Foreman, Senior Software Engineer, HPCC Systems, LexisNexis Risk Solutions
• In Search of the Lost Tutorial – the best ECL lesson you have never seen.
In this presentation, I will explore David Bayliss’ ECL Bible Tutorial, with particular focus on the GRAPH function and building the inverted index for the ROXIE search.
Bob Foreman has worked with the HPCC Systems technology platform and the ECL programming language for over 5 years, and has been a technical trainer for over 25 years. He is the developer and designer of the HPCC Systems Online Training Courses, and is the Senior Instructor for all classroom and Webex/Lync based training.
- Q&A / Adjourn
Submit a talk for an upcoming episode!
Have a new success story to share?
Want to pitch a new use case?
Have a new HPCC Systems application you want to demo?
Want to share some helpful ECL tips and code sample?
Have a new suggestion for the roadmap?
It’s easy! All you need to do is submit a talk title and brief abstract to firstname.lastname@example.org. If chosen, you will be asked to present remotely for an upcoming 20 minute tech talk.
Alpharetta, GA - USA
Thursday, March 30 at 11:00 AM