Data Software Engineering Daily

Informações:

Sinopsis

Databases and data engineering episodes of Software Engineering Daily

Episodios

  • Better Stack: A New DevOps Experience with Juraj Masar

    15/07/2021 Duración: 45min

    DevOps has shortened the development life cycle for countless applications and is embraced by companies around the world. But managing and monitoring multiple environments is still a major pain point, particularly when companies need to mix cloud and legacy systems. Knowing when services go down and quickly pinpointing the cause is essential for continuous development.  The post Better Stack: A New DevOps Experience with Juraj Masar appeared first on Software Engineering Daily.

  • Data Science on AWS: Implementing AI and ML Pipelines on AWS with Chris Fregly

    14/07/2021 Duración: 47min

    Data science is an interdisciplinary field that combines strong technical skills with industry knowledge to perform a large range of jobs. Data scientists solve business questions with hands-on work cleaning and analyzing data, building machine learning models and applying algorithms, and generating dynamic visuals and tools to understand the world from the data it generates. The post Data Science on AWS: Implementing AI and ML Pipelines on AWS with Chris Fregly appeared first on Software Engineering Daily.

  • Data Lineage: Understanding Data Lineage at Scale with Julien Le Dem

    12/07/2021 Duración: 59min

    Big Data has exploded the past decade as cloud computing and more efficient hardware made scaling essentially limitless. Products like Uber revolve entirely around analyzing data to provide rides. According to an EMC/IDC study, there was approximately 5.2TB of data for every person in 2020. That estimate was made before the transition to remote work, The post Data Lineage: Understanding Data Lineage at Scale with Julien Le Dem appeared first on Software Engineering Daily.

  • Text Blaze: Text Shortcuts with Scott Fortmann-Roe

    03/07/2021 Duración: 46min

    There are over 4 billion people using email. Many people using email for business communicate quick questions to colleagues, send repetitive, template-based information to potential customers and freshly hired employees, and repeat a lot of the same phrases. We actually repeat phrases in a lot of written formats. How often do you copy and paste The post Text Blaze: Text Shortcuts with Scott Fortmann-Roe appeared first on Software Engineering Daily.

  • LayerCI with Colin Chartier

    02/07/2021 Duración: 42min

    Continuous integration is a coding practice where engineers deliver incremental and frequent code changes to create higher quality software and collaborate more. Teams attempting to continuously integrate new code need a consistent and automated pipeline for reviewing, testing, and deploying the changes. Otherwise change requests pile up in the queue and nothing gets integrated efficiently.  The post LayerCI with Colin Chartier appeared first on Software Engineering Daily.

  • Meltano: ELT for DataOps with Douwe Maan

    01/07/2021 Duración: 56min

    ELT is a process for copying data from a source system into a target system. It stands for “Extract, Load, Transform” and starts with extracting a copy of data from the source location. It’s loaded into the target system like a data warehouse, and then it’s ready to be transformed into a usable format for The post Meltano: ELT for DataOps with Douwe Maan appeared first on Software Engineering Daily.

  • Uber Data Science with Kevin Novak

    24/06/2021 Duración: 50min

    Uber is one of many examples we’ve discussed on this show that has changed the world with big data analysis. With over 8 million users, 1 billion Uber trips and people driving for Uber in over 400 cities and 66 countries, Uber has redefined an entire industry in a very short time frame. It’s difficult The post Uber Data Science with Kevin Novak appeared first on Software Engineering Daily.

  • Axiom Browser Automation with Yaseer Sheriff

    23/06/2021 Duración: 38min

    The quantity and quality of a company’s data can mean the difference between a major success or major failure. Companies like Google have used big data from its earliest days to steer their product suite in the direction consumers need. Other companies, like Apple, didn’t always use big data analytics to drive product design, but The post Axiom Browser Automation with Yaseer Sheriff appeared first on Software Engineering Daily.

  • StreamSets: DataOps and Smart Pipelines with Arvind Prabhakar

    17/06/2021 Duración: 49min

    The company StreamSets is enabling DataOps practices in today’s enterprises. StreamSets is a data engineering platform designed to help engineers design, deploy, and operate smart data pipelines. StreamSets Data Collector is a codeless solution for designing pipelines, triggering CDC operations, and monitoring data in flight. StreamSets Transformer uses Apache Spark to generate insights about your The post StreamSets: DataOps and Smart Pipelines with Arvind Prabhakar appeared first on Software Engineering Daily.

  • Blissfully: Comprehensive IT Management with Aaron White

    16/06/2021 Duración: 55min

    Delivering Saas products involves a lot more than just building the product. Saas management involves customer relationship management, licensing, renewals, maintaining software visibility, and the general management of the technology portfolio.  The company Blissfully helps businesses manage their SaaS products from within a complete IT platform with organization, automation, and security built in. The Blissfully The post Blissfully: Comprehensive IT Management with Aaron White appeared first on Software Engineering Daily.

  • Stemma: Understanding Big Data with Mark Grover

    15/06/2021 Duración: 40min

    Amundsen was started at Lyft and is the leading open-source data catalog with the fastest-growing community and the most integrations. Amundsen enables you to search your entire organization by text search, see automated and curated metadata, share context with co workers, and learn from others by seeing most common queries on a table or frequently The post Stemma: Understanding Big Data with Mark Grover appeared first on Software Engineering Daily.

  • Data Exploration with a New Python Library with Doris Lee

    27/05/2021 Duración: 47min

    Data exploration uses visual exploration to understand what is in a dataset and the characteristics of the data. Data scientists explore data to understand things like customer behavior and resource utilization. Some common programming languages used for data exploration are Python, R, and Matlab.  Doris Jung-Lin Lee is currently a Graduate Research Assistant at the The post Data Exploration with a New Python Library with Doris Lee appeared first on Software Engineering Daily.

  • Firebolt: Data Warehouses with Eldad Farkash

    25/05/2021 Duración: 57min

    Cloud data warehouses are databases hosted in cloud environments. They provide typical benefits of the cloud like flexible data access, scalability, and performance.  The company Firebolt provides a cloud data warehouse built for modern data environments. It decouples storage and compute to operate on top of existing data lakes like S3. It computes orders of The post Firebolt: Data Warehouses with Eldad Farkash appeared first on Software Engineering Daily.

  • Preset: Visualizing Big Data with Srini Kadamati

    20/05/2021 Duración: 53min

    Apache Superset is an open-source, fast, lightweight and modern data exploration and visualization platform. It can connect to any SQL based data source through SQLAlchemy at petabyte scale. Its architecture is highly scalable and it ships with a wide array of visualizations. The company Preset provides a powerful, easy to use data exploration and visualization The post Preset: Visualizing Big Data with Srini Kadamati appeared first on Software Engineering Daily.

  • ClickHouse: Data Warehousing with Robert Hodges

    17/05/2021 Duración: 44min

    Columnar databases store and retrieve columns of data rather than rows of data. Each block of data in a columnar database stores up to 3 times as many records as row-based storage. This means you can read data with a third of the power needed in row-based data, among other advantages. The company Altinity is The post ClickHouse: Data Warehousing with Robert Hodges appeared first on Software Engineering Daily.

  • Apache Hudi: Large Scale Data Systems with Vinoth Chandar

    13/05/2021 Duración: 51min

    Apache Hudi is an open-source data management framework used to simplify incremental data processing and data pipeline development. This framework more efficiently manages business requirements like data lifecycle and improves data quality. Some common use cases for Hudi is record-level insert, update, and delete, simplified file management and near real-time data access, and simplified CDC The post Apache Hudi: Large Scale Data Systems with Vinoth Chandar appeared first on Software Engineering Daily.

  • Akita: Application Programming Interfaces with Jean Yang

    12/05/2021 Duración: 50min

    An application programming interface, API for short, is the connector between 2 applications. For example, a user interface that needs user data will call an endpoint, like a special URL, with request parameters and receive the data back if the request is valid. Modern applications rely on APIs to send data back and forth to The post Akita: Application Programming Interfaces with Jean Yang appeared first on Software Engineering Daily.

  • Nextmv: Optimization in Fluid Work Environments with Carolyn Mooney

    11/05/2021 Duración: 53min

    The traveling salesman problem is a classic challenge of finding the shortest and most efficient route for a person to take given a list of destinations. This is one of many real-world optimization problems that companies encounter. How should they schedule product distribution, or promote product bundles, or define sales territories? The answers to these The post Nextmv: Optimization in Fluid Work Environments with Carolyn Mooney appeared first on Software Engineering Daily.

  • New Relic: Telemetry and Intelligent Observability with Zain Asgar and Ishan Mukherjee

    05/05/2021 Duración: 55min

    In software engineering, telemetry is the data that is collected about your applications. Unlike logging, which is used in the development of apps to pinpoint errors and code flows, telemetry data includes all operational data including logs, metrics, events, traces, usage, and other analytical data. Companies usually visualize this information to troubleshoot problems and understand The post New Relic: Telemetry and Intelligent Observability with Zain Asgar and Ishan Mukherjee appeared first on Software Engineering Daily.

página 5 de 5