Data Software Engineering Daily
- Autor: Vários
- Narrador: Vários
- Editor: Podcast
- Duración: 79:12:22
- Mas informaciones
Informações:
Sinopsis
Databases and data engineering episodes of Software Engineering Daily
Episodios
-
Better Stack: A New DevOps Experience with Juraj Masar
15/07/2021 Duración: 45minDevOps has shortened the development life cycle for countless applications and is embraced by companies around the world. But managing and monitoring multiple environments is still a major pain point, particularly when companies need to mix cloud and legacy systems. Knowing when services go down and quickly pinpointing the cause is essential for continuous development. The post Better Stack: A New DevOps Experience with Juraj Masar appeared first on Software Engineering Daily.
-
Data Science on AWS: Implementing AI and ML Pipelines on AWS with Chris Fregly
14/07/2021 Duración: 47minData science is an interdisciplinary field that combines strong technical skills with industry knowledge to perform a large range of jobs. Data scientists solve business questions with hands-on work cleaning and analyzing data, building machine learning models and applying algorithms, and generating dynamic visuals and tools to understand the world from the data it generates. The post Data Science on AWS: Implementing AI and ML Pipelines on AWS with Chris Fregly appeared first on Software Engineering Daily.
-
Data Lineage: Understanding Data Lineage at Scale with Julien Le Dem
12/07/2021 Duración: 59minBig Data has exploded the past decade as cloud computing and more efficient hardware made scaling essentially limitless. Products like Uber revolve entirely around analyzing data to provide rides. According to an EMC/IDC study, there was approximately 5.2TB of data for every person in 2020. That estimate was made before the transition to remote work, The post Data Lineage: Understanding Data Lineage at Scale with Julien Le Dem appeared first on Software Engineering Daily.
-
Text Blaze: Text Shortcuts with Scott Fortmann-Roe
03/07/2021 Duración: 46minThere are over 4 billion people using email. Many people using email for business communicate quick questions to colleagues, send repetitive, template-based information to potential customers and freshly hired employees, and repeat a lot of the same phrases. We actually repeat phrases in a lot of written formats. How often do you copy and paste The post Text Blaze: Text Shortcuts with Scott Fortmann-Roe appeared first on Software Engineering Daily.
-
LayerCI with Colin Chartier
02/07/2021 Duración: 42minContinuous integration is a coding practice where engineers deliver incremental and frequent code changes to create higher quality software and collaborate more. Teams attempting to continuously integrate new code need a consistent and automated pipeline for reviewing, testing, and deploying the changes. Otherwise change requests pile up in the queue and nothing gets integrated efficiently. The post LayerCI with Colin Chartier appeared first on Software Engineering Daily.
-
Meltano: ELT for DataOps with Douwe Maan
01/07/2021 Duración: 56minELT is a process for copying data from a source system into a target system. It stands for “Extract, Load, Transform” and starts with extracting a copy of data from the source location. It’s loaded into the target system like a data warehouse, and then it’s ready to be transformed into a usable format for The post Meltano: ELT for DataOps with Douwe Maan appeared first on Software Engineering Daily.
-
Uber Data Science with Kevin Novak
24/06/2021 Duración: 50minUber is one of many examples we’ve discussed on this show that has changed the world with big data analysis. With over 8 million users, 1 billion Uber trips and people driving for Uber in over 400 cities and 66 countries, Uber has redefined an entire industry in a very short time frame. It’s difficult The post Uber Data Science with Kevin Novak appeared first on Software Engineering Daily.
-
Axiom Browser Automation with Yaseer Sheriff
23/06/2021 Duración: 38minThe quantity and quality of a company’s data can mean the difference between a major success or major failure. Companies like Google have used big data from its earliest days to steer their product suite in the direction consumers need. Other companies, like Apple, didn’t always use big data analytics to drive product design, but The post Axiom Browser Automation with Yaseer Sheriff appeared first on Software Engineering Daily.
-
StreamSets: DataOps and Smart Pipelines with Arvind Prabhakar
17/06/2021 Duración: 49minThe company StreamSets is enabling DataOps practices in today’s enterprises. StreamSets is a data engineering platform designed to help engineers design, deploy, and operate smart data pipelines. StreamSets Data Collector is a codeless solution for designing pipelines, triggering CDC operations, and monitoring data in flight. StreamSets Transformer uses Apache Spark to generate insights about your The post StreamSets: DataOps and Smart Pipelines with Arvind Prabhakar appeared first on Software Engineering Daily.
-
Blissfully: Comprehensive IT Management with Aaron White
16/06/2021 Duración: 55minDelivering Saas products involves a lot more than just building the product. Saas management involves customer relationship management, licensing, renewals, maintaining software visibility, and the general management of the technology portfolio. The company Blissfully helps businesses manage their SaaS products from within a complete IT platform with organization, automation, and security built in. The Blissfully The post Blissfully: Comprehensive IT Management with Aaron White appeared first on Software Engineering Daily.
-
Stemma: Understanding Big Data with Mark Grover
15/06/2021 Duración: 40minAmundsen was started at Lyft and is the leading open-source data catalog with the fastest-growing community and the most integrations. Amundsen enables you to search your entire organization by text search, see automated and curated metadata, share context with co workers, and learn from others by seeing most common queries on a table or frequently The post Stemma: Understanding Big Data with Mark Grover appeared first on Software Engineering Daily.
-
Data Exploration with a New Python Library with Doris Lee
27/05/2021 Duración: 47minData exploration uses visual exploration to understand what is in a dataset and the characteristics of the data. Data scientists explore data to understand things like customer behavior and resource utilization. Some common programming languages used for data exploration are Python, R, and Matlab. Doris Jung-Lin Lee is currently a Graduate Research Assistant at the The post Data Exploration with a New Python Library with Doris Lee appeared first on Software Engineering Daily.
-
Firebolt: Data Warehouses with Eldad Farkash
25/05/2021 Duración: 57minCloud data warehouses are databases hosted in cloud environments. They provide typical benefits of the cloud like flexible data access, scalability, and performance. The company Firebolt provides a cloud data warehouse built for modern data environments. It decouples storage and compute to operate on top of existing data lakes like S3. It computes orders of The post Firebolt: Data Warehouses with Eldad Farkash appeared first on Software Engineering Daily.
-
Preset: Visualizing Big Data with Srini Kadamati
20/05/2021 Duración: 53minApache Superset is an open-source, fast, lightweight and modern data exploration and visualization platform. It can connect to any SQL based data source through SQLAlchemy at petabyte scale. Its architecture is highly scalable and it ships with a wide array of visualizations. The company Preset provides a powerful, easy to use data exploration and visualization The post Preset: Visualizing Big Data with Srini Kadamati appeared first on Software Engineering Daily.
-
ClickHouse: Data Warehousing with Robert Hodges
17/05/2021 Duración: 44minColumnar databases store and retrieve columns of data rather than rows of data. Each block of data in a columnar database stores up to 3 times as many records as row-based storage. This means you can read data with a third of the power needed in row-based data, among other advantages. The company Altinity is The post ClickHouse: Data Warehousing with Robert Hodges appeared first on Software Engineering Daily.
-
Apache Hudi: Large Scale Data Systems with Vinoth Chandar
13/05/2021 Duración: 51minApache Hudi is an open-source data management framework used to simplify incremental data processing and data pipeline development. This framework more efficiently manages business requirements like data lifecycle and improves data quality. Some common use cases for Hudi is record-level insert, update, and delete, simplified file management and near real-time data access, and simplified CDC The post Apache Hudi: Large Scale Data Systems with Vinoth Chandar appeared first on Software Engineering Daily.
-
Akita: Application Programming Interfaces with Jean Yang
12/05/2021 Duración: 50minAn application programming interface, API for short, is the connector between 2 applications. For example, a user interface that needs user data will call an endpoint, like a special URL, with request parameters and receive the data back if the request is valid. Modern applications rely on APIs to send data back and forth to The post Akita: Application Programming Interfaces with Jean Yang appeared first on Software Engineering Daily.
-
Nextmv: Optimization in Fluid Work Environments with Carolyn Mooney
11/05/2021 Duración: 53minThe traveling salesman problem is a classic challenge of finding the shortest and most efficient route for a person to take given a list of destinations. This is one of many real-world optimization problems that companies encounter. How should they schedule product distribution, or promote product bundles, or define sales territories? The answers to these The post Nextmv: Optimization in Fluid Work Environments with Carolyn Mooney appeared first on Software Engineering Daily.
-
New Relic: Telemetry and Intelligent Observability with Zain Asgar and Ishan Mukherjee
05/05/2021 Duración: 55minIn software engineering, telemetry is the data that is collected about your applications. Unlike logging, which is used in the development of apps to pinpoint errors and code flows, telemetry data includes all operational data including logs, metrics, events, traces, usage, and other analytical data. Companies usually visualize this information to troubleshoot problems and understand The post New Relic: Telemetry and Intelligent Observability with Zain Asgar and Ishan Mukherjee appeared first on Software Engineering Daily.