Not just Hadoop, but other big data and cloud technologies, such as HBase, Hypertable, PIG, etc. Looks like they meet at various places around town, at various and sundry times. As the dictionary described a pookah in Harvey: it appears now and then, here and there, to this one and that one. Subscribe to the meetup and get informed.

Atlanta Hadoop Users Group News

Events - Atlanta Hadoop Users Group

Events - Atlanta Hadoop Users Group
  • Atlanta Hadoop Users Group

    Data drift, the gradual morphing of data structure and semantics, is a fact of life in enterprise IT. New requirements force schema changes, the meaning of database columns changes over time, and infrastructure upgrades add new fields to log files. Left unchecked, drift in data sources can cause applications and dataflows to fail, with costly downtime and, in the worst case, corruption in downstream data stores.

    In this session, we'll start by looking at how we can deal with the problem of drift, focusing on the concrete example of replicating a relational database into Hive. We'll then examine some alternative approaches using open source tools such as Sqoop, NiFi and StreamSets Data Collector. Finally, we'll build a simple data pipeline to read the relational schema, create equivalent Hive tables, and then continuously ingest data from the relational database to Hive, altering the Hive schema as columns are added to the source tables.

    Speaker: Pat Patterson has been working with Internet technologies since 1997, building software and communities at Sun Microsystems, Huawei, Salesforce and StreamSets. At Sun, Pat was the community lead for the OpenSSO open source project, while at Huawei he developed cloud storage infrastructure software. As a developer evangelist at Salesforce, Pat focused on identity, integration and the Internet of Things. Now community champion at StreamSets, Pat is responsible for the care and feeding of the StreamSets open source community.

    Dunwoody, GA - USA

    Thursday, April 6 at 7:00 PM

    40

    https://www.meetup.com/Atlanta-Hadoop-Users-Group/events/237675346/

  • Atlanta Hadoop Users Group

    Hadoop and its ancillary tools have become so mature that all the tools needed to make this platform PCI Compliant are available. The talk will touch on PCI requirements and how to map that to tools and processes that supports it. There are also different approaches from a PCI-driven architecture perspective that provides different kinds of trade-offs - 1) Simplicity, 2) Performance, 3) Auditability. 


    Speaker Bio:
    Rommel Garcia is the Security Subject Matter Expert Lead at Hortonworks. He has done a lot of work in helping financial and other organizations establish compliance with commercially or federally mandated regulations like PCI. Rommel is also an organizer of AHUG and a nationally recognized speaker.

    This presentation needs a location.

    Dunwoody, GA - USA

    Wednesday, April 26 at 7:00 PM

    33

    https://www.meetup.com/Atlanta-Hadoop-Users-Group/events/238318045/