Machine Learning Guide

Informações:

Sinopse

This series aims to teach you the high level fundamentals of machine learning from A to Z. I'll teach you the basic intuition, algorithms, and math. We'll discuss languages and frameworks, deep learning, and more. Audio may be an inferior medium to task; but with all our exercise, commute, and chores hours of the day, not having an audio supplementary education would be a missed opportunity. And where your other resources will provide you the machine learning trees, Ill provide the forest. Additionally, consider me your syllabus. At the end of every episode Ill provide the best-of-the-best resources curated from around the web for you to learn each episodes details.

Episódios

  • MLA 021 Databricks

    22/06/2022 Duração: 25min

    Discussing Databricks with Ming Chang from Raybeam (part of DEPT®)

  • MLA 020 Kubeflow

    29/01/2022 Duração: 01h07min

    Conversation with Dirk-Jan Kubeflow (vs cloud native solutions like SageMaker) Dirk-Jan Verdoorn - Data Scientist at Dept Agency Kubeflow. (From the website:) The Machine Learning Toolkit for Kubernetes. The Kubeflow project is dedicated to making deployments of machine learning (ML) workflows on Kubernetes simple, portable and scalable. Our goal is not to recreate other services, but to provide a straightforward way to deploy best-of-breed open-source systems for ML to diverse infrastructures. Anywhere you are running Kubernetes, you should be able to run Kubeflow. TensorFlow Extended (TFX). If using TensorFlow with Kubeflow, combine with TFX for maximum power. (From the website:) TensorFlow Extended (TFX) is an end-to-end platform for deploying production ML pipelines. When you're ready to move your models from research to production, use TFX to create and manage a production pipeline. Alternatives: Airflow MLflow

  • MLA 019 DevOps

    13/01/2022 Duração: 01h14min

    Chatting with co-workers about the role of DevOps in a machine learning engineer's life Expert coworkers at Dept Matt Merrill - Principal Software Developer Jirawat Uttayaya - DevOps Lead The Ship It Podcast (where Matt features often) Devops tools Terraform Ansible Pictures (funny and serious) Which AWS container service should I use? A visual guide on troubleshooting Kubernetes deployments Public Cloud Services Comparison Killed by Google aCloudGuru AWS curriculum

  • MLA 018 Descript

    07/11/2021 Duração: 06min

    (Optional episode) just showcasing a cool application using machine learning Dept uses Descript for some of their podcasting. I'm using it like a maniac, I think they're surprised at how into it I am. Check out the transcript & see how it performed. Descript The Ship It Podcast How to ship software, from the front lines. We talk with software developers about their craft, developer tools, developer productivity and what makes software development awesome. Hosted by your friends at Rocket Insights. AKA shipit.io Brandbeats Podcast by BASIC An agency podcast with views on design, technology, art, and culture. Explore the new microsite at www.brandbeats.basicagency.com

  • MLA 017 AWS Local Development

    06/11/2021 Duração: 01h04min

    Show notes: ocdevel.com/mlg/mla-17 Developing on AWS first (SageMaker or other) Consider developing against AWS as your local development environment, rather than only your cloud deployment environment. Solutions: Stick to AWS Cloud IDEs (Lambda, SageMaker Studio, Cloud9 Connect to deployed infrastructure via Client VPN Terraform example YouTube tutorial Creating the keys LocalStack Infrastructure as Code Terraform CDK Serverless

  • MLA 016 SageMaker 2

    05/11/2021 Duração: 59min

    Part 2 of deploying your ML models to the cloud with SageMaker (MLOps) MLOps is deploying your ML models to the cloud. See MadeWithML for an overview of tooling (also generally a great ML educational run-down.) SageMaker Jumpstart Deploy Pipelines Monitor Kubernetes Neo

  • MLA 015 SageMaker 1

    04/11/2021 Duração: 46min

    Show notes Part 1 of deploying your ML models to the cloud with SageMaker (MLOps) MLOps is deploying your ML models to the cloud. See MadeWithML for an overview of tooling (also generally a great ML educational run-down.) SageMaker DataWrangler Feature Store Ground Truth Clarify Studio AutoPilot Debugger Distributed Training And I forgot to mention JumpStart, I'll mention next time.

  • MLA 014 Machine Learning Server

    18/01/2021 Duração: 51min

    Server-side ML. Training & hosting for inference, with a goal towards serverless. AWS SageMaker, Batch, Lambda, EFS, Cortex.dev

  • MLA 013 Customer Facing Tech Stack

    03/01/2021 Duração: 46min

    Client, server, database, etc.

  • MLA 012 Docker

    09/11/2020 Duração: 30min

    Use Docker for env setup on localhost & cloud deployment, instead of pyenv / Anaconda. I recommend Windows for your desktop.

  • MLG 032 Cartesian Similarity Metrics

    08/11/2020 Duração: 41min

    Show notes at https://ocdevel.com/mlg/32 Normed distances link A norm is a function that assigns a strictly positive length to each vector in a vector space. link Minkowski is generalized. p_root(sum(xi-yi)^p). "p" = ? (1, 2, ..) for below. L1: Manhattan/city-block/taxicab. abs(x2-x1)+abs(y2-y1). Grid-like distance (triangle legs). Preferred for high-dim space. L2: Euclidean. sqrt((x2-x1)^2+(y2-y1)^2. sqrt(dot-product). Straight-line distance; min distance (Pythagorean triangle edge) Others: Mahalanobis, Chebyshev (p=inf), etc Dot product A type of inner product. Outer-product: lies outside the involved planes. Inner-product: dot product lies inside the planes/axes involved link. Dot product: inner product on a finite dimensional Euclidean space link Cosine (normalized dot)

  • MLA 011 Practical Clustering

    08/11/2020 Duração: 34min

    Kmeans (sklearn vs FAISS), finding n_clusters via inertia/silhouette, Agglomorative, DBSCAN/HDBSCAN

  • MLA 010 NLP packages: transformers, spaCy, Gensim, NLTK

    28/10/2020 Duração: 25min

    NLTK: swiss army knife. Gensim: LDA topic modeling, n-grams. spaCy: linguistics. transformers: high-level business NLP tasks.

  • MLA 009 Charting tools

    06/11/2018 Duração: 24min

    matplotlib, Seaborn, Bokeh, D3, Tableau, Power BI, QlikView, Excel

  • MLA 008 Exploratory Data Analysis

    26/10/2018 Duração: 24min

    EDA + charting. DataFrame info/describe, imputing strategies. Useful charts like histograms and correlation matrices.

  • MLA 007 Jupyter Notebooks

    16/10/2018 Duração: 16min

    Run your code + visualizations in the browser: iPython / Jupyter Notebooks.

  • MLA 006 Salary

    19/07/2018 Duração: 18min

    Salary based on location, gender, age, tech... from O'Reilly.

  • MLA 005 Shapes & Sizes

    09/06/2018 Duração: 26min

    Dimensions, size, and shape of Numpy ndarrays / TensorFlow tensors, and methods for transforming those.

  • MLA 003 Storage: HDF, Pickle, Postgres

    24/05/2018 Duração: 17min

    Comparison of different data storage options when working with your ML models.

  • MLA 002 Numpy & Pandas

    24/05/2018 Duração: 17min

    Some numerical data nitty-gritty in Python.

página 1 de 3