SHARING BIG DATA SAFELY
ebook

SHARING BIG DATA SAFELY (ebook)

TED DUNNING / ELLEN FRIEDMAN

$294.00
IVA incluido
Editorial:
O'REILLY MEDIA
Materia
INFORMATICA
ISBN:
9781491953631
Formato:
Epublication content package
Idioma:
Inglés
DRM
Si

Many big data-driven companies today are moving to protect certain types of data against intrusion, leaks, or unauthorized eyes. But how do you lock down data while granting access to people who need to see it? In this practical book, authors Ted Dunning and Ellen Friedman offer two novel and practical solutions that you can implement right away. Ideal for both technical and non-technical decision makers, group leaders, developers, and data scientists, this book shows you how to: Share original data in a controlled way so that different groups within your organization only see part of the whole. You’ll learn how to do this with the new open source SQL query engine Apache Drill. Provide synthetic data that emulates the behavior of sensitive data. This approach enables external advisors to work with you on projects involving data that you can't show them. If you’re intrigued by the synthetic data solution, explore the log-synth program that Ted Dunning developed as open source code (available on GitHub), along with how-to instructions and tips for best practice. You’ll also get a collection of use cases. Providing lock-down security while safely sharing data is a significant challenge for a growing number of organizations. With this book, you’ll discover new options to share data safely without sacrificing security.

Otros libros del autor

  • STREAMING ARCHITECTURE
    TED DUNNING / ELLEN FRIEDMAN
    More and more data-driven companies are looking to adopt stream processing and streaming analytics. With this concise ebook, you’ll learn best practices for designing a reliable architecture that supports this emerging big-data paradigm. Authors Ted Dunning and Ellen Friedman (Real World Hadoop) help you explore some of the best technologies to handle stream processing and anal...

    $294.00

  • REAL-WORLD HADOOP
    TED DUNNING / ELLEN FRIEDMAN
    If you’re a business team leader, CIO, business analyst, or developer interested in how Apache Hadoop and Apache HBase-related technologies can address problems involving large-scale data in cost-effective ways, this book is for you. Using real-world stories and situations, authors Ted Dunning and Ellen Friedman show Hadoop newcomers and seasoned users alike how NoSQL databases...

    $284.00

  • PRACTICAL MACHINE LEARNING: INNOVATIONS IN RECOMMENDATION
    TED DUNNING / ELLEN FRIEDMAN
    Building a simple but powerful recommendation system is much easier than you think. Approachable for all levels of expertise, this report explains innovations that make machine learning practical for business production settings—and demonstrates how even a small-scale development team can design an effective large-scale recommendation system. Apache Mahout committers Ted Dunnin...

    $254.00

  • PRACTICAL MACHINE LEARNING: A NEW LOOK AT ANOMALY DETECTION
    TED DUNNING / ELLEN FRIEDMAN
    Finding Data Anomalies You Didn't Know to Look For Anomaly detection is the detective work of machine learning: finding the unusual, catching the fraud, discovering strange activity in large and complex datasets. But, unlike Sherlock Holmes, you may not know what the puzzle is, much less what “suspects” you’re looking for. This O’Reilly report uses practical examples to explain...

    $254.00

  • PRACTICAL MACHINE LEARNING: INNOVATIONS IN RECOMMENDATION
    TED DUNNING / ELLEN FRIEDMAN
    Building a simple but powerful recommendation system is much easier than you think. Approachable for all levels of expertise, this report explains innovations that make machine learning practical for business production settings—and demonstrates how even a small-scale development team can design an effective large-scale recommendation system. Apache Mahout committers Ted Dunnin...

    $254.00

  • REAL-WORLD HADOOP
    TED DUNNING / ELLEN FRIEDMAN
    If you’re a business team leader, CIO, business analyst, or developer interested in how Apache Hadoop and Apache HBase-related technologies can address problems involving large-scale data in cost-effective ways, this book is for you. Using real-world stories and situations, authors Ted Dunning and Ellen Friedman show Hadoop newcomers and seasoned users alike how NoSQL databases...

    $284.00