Quickly build and deploy massive data pipelines and improve productivity using Azure DatabricksKey FeaturesGet to grips with the distributed training and deployment of machine learning and deep learning modelsLearn how ETLs are integrated with Azure Data Factory and Delta LakeExplore deep learning and machine learning models in a distributed computing infrastructureBook DescriptionMicrosoft Azure Databricks helps you to harness the power of distributed computing and apply it to create robust data pipelines, along with training and deploying machine learning and deep learning models.
Use PySpark to easily crush messy data at-scale and discover proven techniques to create testable, immutable, and easily parallelizable Spark jobsKey FeaturesWork with large amounts of agile data using distributed datasets and in-memory cachingSource data from all popular data hosting platforms, such as HDFS, Hive, JSON, and S3Employ the easy-to-use PySpark API to deploy big data Analytics for productionBook DescriptionApache Spark is an open source parallel-processing framework that has been around for quite some time now.
This book describes a set of methods, architectures, and tools to extend the data pipeline at the disposal of developers when they need to publish and consume data from Knowledge Graphs (graph-structured knowledge bases that describe the entities and relations within a domain in a semantically meaningful way) using SPARQL, Web APIs, and JSON.
Get unique insights from your data by combining the power of SQL Server, R and PythonKey FeaturesUse the features of SQL Server 2017 to implement the data science project life cycleLeverage the power of R and Python to design and develop efficient data modelsfind unique insights from your data with powerful techniques for data preprocessing and analysisBook DescriptionSQL Server only started to fully support data science with its two most recent editions.
Discover the power of location data to build effective, intelligent data models with Geospatial ecosystemsKey FeaturesManipulate location-based data and create intelligent geospatial data modelsBuild effective location recommendation systems used by popular companies such as UberA hands-on guide to help you consume spatial data and parallelize GIS operations effectivelyBook DescriptionData scientists, who have access to vast data streams, are a bit myopic when it comes to intrinsic and extrinsic location-based data and are missing out on the intelligence it can provide to their models.
Add Data and Analytics to Your TD ToolkitInstructional design pro Megan Torrance addresses the importance of instructional designers accessing and applying learning and performance datafrom how to design learning experiences with data collection in mind to how to use the data to improve and evaluate those experiences.
Application Performance Management (APM) in the Digital Enterprise enables IT professionals to be more successful in managing their company's applications.
More stimulating mathematics puzzles from bestselling author Paul NahinHow do technicians repair broken communications cables at the bottom of the ocean without actually seeing them?
Learn through hands-on exercises covering a variety of topics including data connections, analytics, and dashboards to effectively prepare for the Tableau Desktop Certified Associate examKey FeaturesPrepare for the Tableau Desktop Certified Associate exam with the help of tips and techniques shared by expertsImplement Tableau's advanced analytical capabilities such as forecastingDelve into advanced Tableau features and explore best practices for building dashboardsBook DescriptionThe Tableau Desktop Certified Associate exam measures your knowledge of Tableau Desktop and your ability to work with data and data visualization techniques.
Perform advanced dashboard, visualization, and analytical techniques with Tableau Desktop, Tableau Prep, and Tableau ServerKey FeaturesUnique problem-solution approach to aid effective business decision-makingCreate interactive dashboards and implement powerful business intelligence solutionsIncludes best practices on using Tableau with modern cloud analytics servicesBook DescriptionTableau has been one of the most popular business intelligence solutions in recent times, thanks to its powerful and interactive data visualization capabilities.
Think about your data intelligently and ask the right questionsKey FeaturesMaster data cleaning techniques necessary to perform real-world data science and machine learning tasksSpot common problems with dirty data and develop flexible solutions from first principlesTest and refine your newly acquired skills through detailed exercises at the end of each chapterBook DescriptionData cleaning is the all-important first step to successful data science, data analysis, and machine learning.
Quickly build and deploy massive data pipelines and improve productivity using Azure DatabricksKey FeaturesGet to grips with the distributed training and deployment of machine learning and deep learning modelsLearn how ETLs are integrated with Azure Data Factory and Delta LakeExplore deep learning and machine learning models in a distributed computing infrastructureBook DescriptionMicrosoft Azure Databricks helps you to harness the power of distributed computing and apply it to create robust data pipelines, along with training and deploying machine learning and deep learning models.
Speed up the design and implementation of deep learning solutions using Apache SparkKey FeaturesExplore the world of distributed deep learning with Apache SparkTrain neural networks with deep learning libraries such as BigDL and TensorFlowDevelop Spark deep learning applications to intelligently handle large and complex datasetsBook DescriptionDeep learning is a subset of machine learning where datasets with several layers of complexity can be processed.
This graduate textbook presents fundamentals, applications and evaluation of image segregation, unit description, feature measurement and pattern recognition.
Kickstart your NLP journey by exploring BERT and its variants such as ALBERT, RoBERTa, DistilBERT, VideoBERT, and more with Hugging Face's transformers libraryKey FeaturesExplore the encoder and decoder of the transformer modelBecome well-versed with BERT along with ALBERT, RoBERTa, and DistilBERTDiscover how to pre-train and fine-tune BERT models for several NLP tasksBook DescriptionBERT (bidirectional encoder representations from transformer) has revolutionized the world of natural language processing (NLP) with promising results.
Gain hands-on experience with industry-standard data analysis and machine learning tools in PythonKey FeaturesTackle data science problems by identifying the problem to be solvedIllustrate patterns in data using appropriate visualizationsImplement suitable machine learning algorithms to gain insights from dataBook DescriptionData Science Projects with Python is designed to give you practical guidance on industry-standard data analysis and machine learning tools, by applying them to realistic data problems.
Distributed Systems: Concurrency and Consistency explores the gray area of distributed systems and draws a map of weak consistency criteria, identifying several families and demonstrating how these may be implemented into a programming language.
Perform efficient fast text representation and classification with Facebook's fastText libraryKey FeaturesIntroduction to Facebook's fastText library for NLPPerform efficient word representations, sentence classification, vector representationBuild better, more scalable solutions for text representation and classificationBook DescriptionFacebook's fastText library handles text representation and classification, used for Natural Language Processing (NLP).
This graduate textbook explains image reconstruction technologies based on region-based binocular and trinocular stereo vision, and object, pattern and relation matching.
Enter the exciting world of Julia, a high-performance language for technical computingKey FeaturesLeverage Julia's high speed and efficiency for your applicationsWork with Julia in a multi-core, distributed, and networked environmentApply Julia to tackle problems concurrently and in a distributed environmentBook DescriptionThe release of Julia 1.
A comprehensive guide to mastering the most advanced Hadoop 3 conceptsKey FeaturesGet to grips with the newly introduced features and capabilities of Hadoop 3Crunch and process data using MapReduce, YARN, and a host of tools within the Hadoop ecosystemSharpen your Hadoop skills with real-world case studies and codeBook DescriptionApache Hadoop is one of the most popular big data solutions for distributed storage and for processing large chunks of data.
Implement business intelligence (BI), data modeling, and data analytics within Microsoft products such as Power BI, SQL Server, and ExcelKey FeaturesUnderstand the ins and outs of DAX expressions and querying functions with the help of easy-to-follow examplesManipulate data of varying complexity and optimize BI workflows to extract key insightsCreate, monitor, and improve the performance of models by writing clean and robust DAX queriesBook DescriptionData Analysis Expressions (DAX) is known for its ability to increase efficiency by extracting new information from data that is already present in your model.
Gain practical insights by exploiting data in your business to build advanced predictive modeling applications Key Features A step-by-step guide to predictive modeling including lots of tips, tricks, and best practices Learn how to use popular predictive modeling algorithms such as Linear Regression, Decision Trees, Logistic Regression, and Clustering Master open source Python tools to build sophisticated predictive models Book Description Social Media and the Internet of Things have resulted in an avalanche of data.
This book is written to address the issues relating to data gathering, data warehousing, and data analysis, all of which are useful when working with large amounts of data.
Get up and running with Oracle's premium cloud blockchain services and build distributed blockchain apps with easeKey FeaturesDiscover Hyperledger Fabric and its components, features, qualifiers, and architectureGet familiar with the Oracle Blockchain Platform and its unique featuresBuild Hyperledger Fabric-based business networks with Oracle's premium blockchain cloud serviceBook DescriptionHyperledger Fabric empowers enterprises to scale out in an unprecedented way, allowing organizations to build and manage blockchain business networks.
Power BI Data Analysis and Visualization provides a roadmap to vendor choices and highlights why Microsoft's Power BI is a very viable, cost effective option for data visualization.
Design cost-efficient database solutions, scale enterprise operations and reduce overhead business costs with MySQLKey FeaturesExplore the new and advanced features of MySQL 8.
This is the first comprehensive book dedicated entirely to the field of decision trees in data mining and covers all aspects of this important technique.
Leverage the power of Tableau to get actionable business insights and make better business decisionsKey FeaturesExplore all the new features of Tableau 2018.
Introduction to deep learning and PyTorch by building a convolutional neural network and recurrent neural network for real-world use cases such as image classification, transfer learning, and natural language processing.