LivingDataLab - LivingDataLab AI Technical Blog

Langchain and OpenAI Functions - Conversational agents

In this post we will explore the advanced concept of conversational agents. Building upon our previous articles, we delve into creating a conversational future using Large Language Models (LLMs), blending tool usage with chat memory, akin to the workings of OpenAI’s ChatGPT. This article serves as a comprehensive guide to understanding and implementing these agents.

Nov 8, 2023

Langchain and OpenAI Functions - Tools and Routing

This article will delve into the mechanics of OpenAI’s functions, how to use it to select the appropriate tools for an LLM, and how to execute them efficiently. This can give you the knowledge to create your own tools tailored to specific tasks and to utilize them effectively within the OpenAI framework and Langchain to create powerful LLM based applications.

Nov 7, 2023

Enhancing Data Structuring through Tagging and Extraction with OpenAI and LangChain

Structured data extraction is increasingly becoming an essential tool for developers who wish to harness the power of OpenAI’s capabilities. This blog post aims to provide an understanding of how developers use OpenAI functions for tagging and extraction - two primary use cases central to transforming unstructured text into structured, actionable data.

Nov 6, 2023

OpenAI Function Calling In LangChain

This article focuses on the integration of OpenAI functions with Langchain’s expression language and how this makes applications quicker to produce. We will also delve into the utility of PyDantic, a Python library that simplifies the construction of OpenAI functions.

Nov 5, 2023

An Introduction to LangChain Expression Language (LCEL)

Here we introduce is a new syntax LangChain Expression Language (LCEL) that makes it much easier and more transparent to construct and work with different LLM chains and agents for LLM based applications

Nov 4, 2023

Unlocking Function Calls with OpenAI - A Step-by-Step Guide to Augmenting OpenAI Language Models

OpenAI’s API comes with a range of exciting features, one of which is the ability to call functions. This capability, introduced a few months back, opens up a realm of possibilities, making interactions with the llm-based applications more dynamic and informative.

Nov 3, 2023

AI and the Meta-Crisis - How AI is likely to play a major role in the multiple threats humanity faces and their possible solutions

Humanity finds itself on the precipice of multiple self-induced existential threats such as climate change and conflicts. A challenge is that solutions to some problems might make other problems worse. This article introduces the concept of the Meta-Crisis, and describes how AI plays an essential role in both problems and solutions

Oct 15, 2023

Using LLMs and Langchain to Ask Questions about SQL Databases

Some of the most valuable information to make LLMs useful is in structured data such as atabases. In this article we show how we can use langchain to help LLMs answer questions based on information stored in an SQL database.

Aug 20, 2023

Comparing Question and Answer LLM System Outputs

In this article we show how to use labeled preference scoring to help compare two versions of a system and choose the preferred outputs

Aug 19, 2023

Evaluating Question and Answer Systems with Dynamic Data

In many real-world settings, the proper answer to a question may alter over time. For example, if you’re designing a Q&A system on top of a database or that connects to an API, the underlying data may be updated regularly. Instead of storing labels directly as values, we’ll utilise references to overcome this issue in this post using Langsmith where our labels will be references to look up the relevant values.

Aug 18, 2023

Measuring the Accuracy of an LLM based Question and Answering System

Evaluating a question and response system can help you improve its system design as well as the prompt and model quality. We tend to improve what we can measure, therefore verifying for correctness is a key focus. In this post, we will utilise LangSmith to test the accuracy of a Q&A system against an example dataset.

Aug 17, 2023

Langsmith for LLM Application Evaluation & Monitoring

Developing LLM based applications is now possible using libraries such as Langchain, but taking these applications into production can involve many challenges such as evaluation & monitoring. Langsmith is a new tool that can help with these challenges of taking LLMs into production.

Aug 16, 2023

Guarding Against Undesirable LLM Outputs with the Self-Critique Chain

While language models have remarkable capabilities they can occasionally generate undesirable outputs. Here we addresses this issue by introducing the self-critique chain which acts as a mechanism to ensure model responses are appropriate in a production environment.

Aug 15, 2023

Creating a Voice Assistant for your Knowledge Base

In this article we are going to create a voice assistant for your knowledge base! This will outline how you can develop your very own voice assistant employing state-of-the-art artificial intelligence tools

Aug 14, 2023

A YouTube Video Summarizer Using Whisper and LangChain

In this post we dive into the challenge of summarizing YouTube videos efficiently in the context of the digital age. It will introduce two cutting-edge tools, Whisper and LangChain, that can help tackle this issue. We will discuss the strategies of “stuff,” “map-reduce,” and “refine” for handling large amounts of text and extracting valuable information.

Aug 13, 2023

Chains and why they are used in Langchain

In this post we delve deeper into the concept of chains, which provide an end-to-end pipeline for utilizing language models. These chains seamlessly integrate models, prompts, memory, parsing output, and debugging capabilities, offering a user-friendly interface.

Aug 12, 2023

Building a Customer Support Question Answering Chatbot

Here, we show how to use website material as additional context to help a chatbot efficiently reply to user queries. The code implementation uses data loaders, and stores the associated embeddings in the Deep Lake dataset, and then retrieves the documents that are most relevant to the user’s query.

Aug 11, 2023

Exploring Embeddings for Large Language Models

High-dimensional vectors called embeddings are used to store semantic data. Textual data can be transformed into embedding space by large language models, enabling flexible representations across languages. These embeddings act as useful tools for identifying relevant information.

Aug 10, 2023

Text Splitters for Retrieval and Large Language Models

Giving documents to the LLM as information sources and asking it to produce an answer based on the information it extracts from the document is one strategy for reducing hallucinations - in this article we will look at how text splitters can help with this

Aug 9, 2023

Streamlined Data Ingestion for LLMs

The LangChain library provides a number of assistance classes that are intended to make it easier to load and extract data from various sources which we will cover in this post

Aug 8, 2023

Exploring The Role of LangChain’s Indexes and Retrievers

With an emphasis on the function of indexes and retrievers - here we will examine some of the benefits and drawbacks of employing document-based LLMs that use these

Aug 7, 2023

Creating Knowledge Graphs from Textual Data and LLM’s

Here we walk through a simple workflow for creating a knowledge graph from textual data, making complex information more accessible and easier to understand

Aug 6, 2023

An Improved News Articles Summarizer

Our goal in this post is to improve a news summarisers ability to extract the most important information from lengthy news items and display it in an easy-to-read bulleted list format

Aug 5, 2023

Managing Large Language Model Outputs with Parsers

This article covers the different types of parsing objects used for LLMs and the troubleshooting processing

Aug 4, 2023

Getting the Best of Few Shot Prompts and Example Selectors for LLMs

In this article, we’ll examine how example selectors and few-shot prompts might improve LangChain’s language model performance

Aug 3, 2023

Using Prompt Templates with Large Language Models

This article explores the subtleties of PromptTemplates and efficient ways to use them. A PromptTemplate is a pre-established pattern or framework used to create efficient and dependable prompts for extensive language models - it serves as a guide to make sure the input text or prompt is formatted correctly

Aug 2, 2023

Prompt Engineering Tips and Tricks for Large Language Models

The aim of this post is to provide a strong basis in the knowledge and techniques required to develop effective prompts that empower LLMs to provide precise, contextually relevant, and insightful responses.

Aug 1, 2023

Popular Large Language Models Compared

We will examine the integration of various LLM models in LangChain in this article

Jul 31, 2023

Using the Open Source GPT4All Large Language Model

There are many Large Language Models many are not fully accesible - access to the weights and architecture of these models is limited, and even if one does it requires a large amount of resources to carry out any activities and building on top of these APIs is not free. Open-source models like GPT4All get over these limitations and increase everyones access to the LLMs

Jul 30, 2023

A News Article Summariser with OpenAI and Langchain

In this project we create a News Articles Summarizer application utilising ChatGPT and LangChain to help save time staying current on news and information in the fast-paced world of today

Jul 29, 2023

Large Language Models v Chat Models

In LangChain LLMs and Chat Models are two different kinds of models that are used for various tasks involving natural language processing - the distinctions between LLMs and Chat Models as well as their distinctive applications and implementation strategies within LangChain will be covered in this article

Jul 28, 2023

The Activeloop Deep Lake Vector Store for Agents & Large Language Models

Activeloop Deep Lake provides storage for embeddings and their corresponding metadata in the context of LLM apps, and enables hybrid searches on these embeddings and their attributes for efficient data retrieval and integrates with LangChain and Agents

Jul 27, 2023

LLaMa-2 70B Chatbot in Hugging Face and LangChain

In this article we will look at how we can use the open source Llama-70b-chat model in both Hugging Face transformers and LangChain

Jul 26, 2023

Chat with Your Data using Memory and Langchain

In this article we are going to give a chatbot memory to help it better ask questions about data using langchain.

Jul 25, 2023

Questioning and Answering over Data with LangChain

In this article we look at how you can split documents extract the relevant data take a question and pass them both to a language model, and ask it to answer the question using Langchain.

Jul 24, 2023

Advanced Vectorstore Retrieval using LangChain

In this article we look at how you can retrieve content from a vectorstore using state-of-the-art methods to ensure only the most relevant content is made available for Large Language Models.

Jul 23, 2023

Vectorstores and Embeddings with LangChain

In this article we look at how to convert documents into vector stores an embeddings as an important step in making content available for Large Language Models.

Jul 22, 2023

Document Splitting with LangChain

In this article we look at how you can split documents as an important step in making content available for Large Language Models

Jul 21, 2023

LLM Application Considerations - Part 2

In this post we look at several aspects to consider when deploying a Large Language Model (LLM) into an application such as chain-of-thought reasoning, program-aided language models (PAL), the REAct framework combining reason and action, application architectures, and responsible AI.

Jul 20, 2023

LLM Application Considerations - Part 1

In this post we look at several aspects to consider when deploying a Large Language Model (LLM) into an application such as Model optimizations, a Generative AI project lifecycle cheat sheet, and how LLM’s can be turned into useful applications using external data sources and services.

Jul 19, 2023

Fine-Tuning FLAN-T5 with Reinforcement Learning (PPO) and PEFT to Generate Less-Toxic Summaries

In this project we will fine-tune a FLAN-T5 model to generate less toxic content with Meta AI’s hate speech reward model

Jul 18, 2023

Reinforcement learning from human feedback (RLHF) using Proximal Policy Optimisation

In this post we will look at Proximal Policy Optimization which is a powerful algorithm for solving reinforcement learning problems

Jul 17, 2023

Reinforcement learning from human feedback (RLHF) For LLMs - Part 2

Here we look at more advanced aspects of Reinforcement learning from human feedback (RLHF) in particular the reward model, use of chain-of-thought prompting and looking at the challenges LLMs face with knowledge cut-offs

Jul 16, 2023

Reinforcement learning from human feedback (RLHF) For LLMs - Part 1

In this post we will introduce Reinforcement learning from human feedback (RLHF) which is an important method used in modern large language models to help improve the performance and alignment of large language models.

Jul 15, 2023

Fine-Tuning a Generative AI Model for Dialogue Summarization

In this project I will fine-tune an existing Large Language Model from Hugging Face for enhanced dialogue summarization

Jul 14, 2023

Parameter Efficient Fine-Tuning (PEFT) for Large Language Models

Training large language models can be computationally and financially expensive. Parameter efficient fine tuning techniques only modify a restricted number of parameters and can result in drastically reduce costs and training time.

Jul 13, 2023

Evaluating Fine-Tuned Large Language Models

In this article we explore several metrics that are used by developers of large language models that you can use to assess the performance of your own models and compare to other models out in the world

Jul 12, 2023

Multi-task Instruction Fine-Tuning for Large Language Models

In this post, we’ll look at techniques you might employ to make an existing large language model more effective for your particular use case using a method called instruction fine-tuning, and in particular see how this can be used to optimise for multiple tasks as the same time.

Jul 11, 2023

Improve Large Language Models with Instruction Fine-Tuning

In this article we will look at methods that you can use to improve the performance of an existing large language model for your specific use case using instruction fine-tuning

Jul 10, 2023

Pre-training Large Language Models for Domain Adaptation

Here we will examine particular use cases where it might make sense to train a large language model from scratch. These use cases are often characterised by situations that use language in a very unique way such as legal or medical text

Jul 9, 2023

Scaling Laws and Compute Optimal Large Language Models

In this article we look at research that has looked at the relationship between model size, training, configuration, and performance to try to pinpoint the optimal size for large language models

Jul 8, 2023

Computational Challenges fo training LLMs

Running out of memory is one of the most frequent problems you still encounter when trying to train large language models. In this article we look at strategies used to help train these models more efficiently.

Jul 7, 2023

Choosing a Pre-Trained Large Language Model

In this article we will look at different types of pre-trained models and see how these are suited for different tasks - this can help you choose the best model for your LLM use-case

Jul 6, 2023

Summarising Dialogue using Generative AI

Here I will explore dialogue summarization using generative AI and will look at how the input text affects the output of the model and use prompt engineering to direct it towards the task we need

Jul 5, 2023

An Approach to the Generative AI Project Lifecyle

In this article I will present a high level project architecture for building Generative AI projects that could be applied to any project proposed by DeepLearning AI and AWS in their Generative AI with Large Language Models Course

Jul 4, 2023

Generative Configuration for Large Language Models

In this article we will take a high level non-technical view of what generative configuration options for Large language models allow you to do

Jul 3, 2023

A High Level Overview of Prompting and In-Context Learning for Large Language Models

Here we will take a high level non-technical view of what prompting is all about and introduce in-context learning

Jul 2, 2023

A High Level Overview of the Transformer Model - The Magic Behind Recent Advances in AI

In this article we will take a high level non-technical view of key aspects of the Transformer Model - the technology behind recent advances in AI

Jul 1, 2023

Evaluating the outputs of Large Language Model Applications for Ambiguous Criteria

Here we look at some best practices for evaluating the outputs of an LLM application when you do not have a clear sense of the right output or its ambiguous - to help us know before and after deployment how well its working

Jun 26, 2023

Evaluating the outputs of Large Language Model Applications for Clear Criteria

Here we look at some best practices for evaluating the outputs of an LLM application when you do have a clear sense of the right output - to help us know before and after deployment how well its working

Jun 25, 2023

Creating Better Chatbots using Chained Prompts and Quality Checks

Here, we will put together chained prompts, moderation and other quality checks to create a better customer services chatbot using ChatGPT

Jun 24, 2023

Checking Outputs of Large Language Models like ChatGPT

In this article we will focus on checking outputs generated by an LLM before showing them to users - which can be important for ensuring the quality, relevance, and safety of the responses provided to them or used in automation flows

Jun 23, 2023

Chaining Multiple Prompts together using ChatGPT for Better Task Execution

Here using ChatGPT we will see how to split complex tasks into a series of simpler subtasks by chaining multiple prompts together which can help provide better results than trying to perform a task using just one prompt

Jun 22, 2023

Using Chain of Thought Reasoning with ChatGPT

In this article we will focus on large language model tasks to process a series of inputs i.e. the tasks that take the input and generate a useful output often through a series of steps - using ChatGPT

Jun 21, 2023

Evaluating Moderation Inputs for Large Language Models

In this article we look at how you evaluate moderation inputs to large language models, which is important when creating LLM applications that involve chains of multiple inputs and outputs to LLMs to ensure that users are behaving responsibly and aren’t trying to exploit the system in any manner

Jun 20, 2023

Evaluating Classification Inputs for Large Language Models

Here we look at how you evaluate classiciation inputs to large language models, which is important when creating LLM applications that involve chains of multiple inputs and outputs to LLMs

Jun 19, 2023

An overview Language Models, the Chat format and Tokens

Here we give a brief overview of how LLM’s work, how they are trained, what is a tokeniser and how a choice of different tokenisers can effect the output of the LLM. We will also look at what the ‘chat format’ for LLM’s is all about

Jun 18, 2023

Key Considerations when Creating Practical Applications using Large Language Models

Creating useful applications with AI & Large Language Models involves many aspects, here I highlight key considerations when building these applications & describe how I built & deployed 6 LLM applications with LangChain to summarise or chat with documents, web pages or youtube videos

Jun 14, 2023

Creating LLM based Agents using LangChain

In this project we will use LangChain to create LLM based agents which can help answer questions, reason through content or even to decide what to do next based on various information sources or tools you can give it access to

Jun 6, 2023

Using LangChain to Evaluate LLM Applications

In this article we look at how LangChain can help evaluate LLM performance for a specific Application

Jun 5, 2023

Question and Answering for Documents using LangChain

In this article we look at how LangChain can perform question answering over documents using embeddings and vector stores.

Jun 4, 2023

Using Chains with LangChain

Here we will look at the Chains component of LangChain and see how this can help us combine different sequences of events using LLM’s.

Jun 3, 2023

Using LangChain Memory for LLM Applications

Here we look at how LangChain can give useful memory to improve LLM model responses.

Jun 2, 2023

Using LangChain for LLM Application Development

LangChain is an intuitive open-source python framework created to simplify the development of useful applications using LLMs. In this article we introduce the framwwork then look at the Models, Prompts and Parsers components of LangChain.

Jun 1, 2023

Using ChatGPT to Create a Customised Chatbot

In this project we will use ChatGPT to utilize its chat format to have extended conversations with chatbots personalized or specialized for specific tasks or behaviors.

May 7, 2023

Expanding & Customising Text using Large Language Models

We will use ChatGPT to generate customer service emails that are tailored to each customer’s review.

May 6, 2023

Large Language Models for Text Transformation

In this article we will explore how to use Large Language Models for text transformation tasks such as language translation, spelling and grammar checking, tone adjustment, and format conversion.

May 5, 2023

Inferring with Text Prompts for Large Language Models

Here we look at how to use Large Language Models such as ChatGPT to infer sentiment and topics from product reviews and news articles

May 4, 2023

Creating Prompts to Summarise Text with Large Language Models

In this article we look at how to use Large Language Models such as ChatGPT to summarize text with a focus on specific topics

May 3, 2023

Iterative Prompt Development for Large Language Models

Here we look at how to develop prompts for large language models iteratively

May 2, 2023

Best Practice for Prompting Large Language Models to Generate Good Output

In this article we look at two prompting principles and their related tactics in order to write effective prompts for large language models.

May 1, 2023

Fine-tuning a Sentiment Analysis Model with Hugging Face

In this project we fine-tune a pre-trained model for sentiment analysis model using Hugging Face

Apr 23, 2023

Fine-tuning a Text Similarity model with Hugging Face - Fine Tune the Model

In this article we will look in a bit more detail at what you might need to do to fine-tune a pre-trained model for text similarity using Hugging Face

Apr 2, 2023

Fine-tuning a Text Similarity model with Hugging Face - Dataset Preparation

In this article we will look in a bit more detail at what you might need to do to prepare your data for fine-tuning a pre-trained model for text similarity using Hugging Face

Apr 1, 2023

An Introduction to the Transformer Model - The power behind recent advances in AI

In this non-technical article we describe the basics of how transfomer models work which is the underlying technology behind Chat-GPT and most of the recent advances in AI

Mar 29, 2023

Using an efficient transformer to create an interactive and more complex chatbot

Here we are going to use the Reformer aka the efficient Transformer to create a more advanced conversational chatbot. It will learn how to understand context to better answer questions and it will also know how to ask questions if it needs more info, which could be useful for customer service applications.

Mar 28, 2023

Reversable residual networks for more efficient transfomer models

In this post we will explore Reversible Residual Networks and see how they can be used to improve Transfomer models

Mar 27, 2023

Making more efficient attention for transformers with reversable layers and Locality Sensitive Hashing (LSH)

Here we look at how to make transfomers more efficient using Reversible Layers and Locality Sensitive Hashing (LSH)

Mar 26, 2023

Customising a Chatbot with Fine Tuning and Hugging Face Pretrained Models

In this article, we will fine-tune a model using Hugging Face transformers to create a better chat bot for question answering

Mar 25, 2023

Creating a Chatbot with Hugging Face Pretrained Models

We will use Hugging Face transformers to download and use the DistilBERT model to create a chat bot for question answering

Mar 24, 2023

Implementing the T5 text transformer model

We implement the Text to Text Transfer from Transformers model (better known as T5) which can perform a wide variety of NLP tasks and is a versatile model.

Mar 22, 2023

Creating a Transformer Model for Text Summarisation

Text summarization is an important task in natural language processing. In this article we will create a transfomer decoder model to perform text summarization.

Mar 18, 2023

Implementing GPT-2 A Transfomer Decoder NLP Model

In this article we’ll explore the transformer decoder which is the architecture behind GPT-2 and see how to implement it with trax.

Mar 11, 2023

3 Types of Attention for Transfomer based NLP Models

In this article we explore the three ways of attention (encoder-decoder attention, causal attention, and bi-directional self attention) used in transformer NLP models and introducted in the 2017 paper Attention is all you need and see how to implement the latter two with dot product attention.

Mar 4, 2023

Improving seq2seq Language Models using Scaled Dot-Product Attention

The 2017 paper Attention Is All You Need introduced the Transformer model and scaled dot-product attention, sometimes also called QKV (Queries, Keys, Values) attention. In this article we’ll implement a simplified version of scaled dot-product attention and replicate word alignment between English and French, as shown in the earlier paper Bhadanau, et al. (2014).

Mar 2, 2023

Improving seq2seq Language Models using Basic Attention

The attention mechanism is behind some of the recent advances in deep learning using the Transfomer model architecture. In this article we look at the first attention mechanism proposed in a paper by Bhadanau et al (2014) used to improve seq2seq models for language translation.

Mar 1, 2023

Custom Models and human-in-the-loop pipelines with AWS Augmented AI (A2I)

In this project we will create our own human workforce, a human task UI, and then define the human review workflow to perform data labeling for an ML task.

Feb 24, 2023

Advanced Model Deployment on AWS - A/B testing traffic shifting and autoscaling

AWS Sagemaker offers many options for deploying models, in this project we will create an endpoint for a text classification model, splitting the traffic between them. Then after testing and reviewing the endpoint performance metrics, we will shift the traffic to one variant and configure it to autoscale.

Feb 22, 2023

Optimize Models in the Cloud using AWS Automatic Model Tuning

When training ML models, hyperparameter tuning is a step taken to find the best performing training model. In this article we will apply a random algorithm of Automated Hyperparameter Tuning to train a BERT-based natural language processing (NLP) classifier. The model analyzes customer feedback and classifies the messages into positive, neutral, and negative sentiments.

Feb 14, 2023

Building an AWS SageMaker Pipeline for a BERT Based text classifier

In this project we train and deploy a BERT Based text classifier using AWS Sagemaker pipelines, and describe how this can help with MLOPS to provide the most efficient path to production for training deploying and maintaining machine learning models at scale in production.

Feb 12, 2023

Train a Review Classifier with BERT and Amazon SageMaker

We train a text classifier using a variant of the BERT deep learning model architecture called RoBERTa - a Robustly Optimized BERT Pretraining Approach, within a PyTorch model ran as a SageMaker Training Job.

Feb 11, 2023

Feature Transformation with Amazon SageMaker Processing Job and Feature Store

We will prepare to train a BERT-based natural language processing (NLP) model converting review text into machine-readable features used by BERT. With the required feature transformation we will configure an Amazon SageMaker processing job to perform the task.

Feb 8, 2023

Creating a Sentiment Analysis Text Classification Model using AWS SageMaker BlazingText

In this article we will use the AWS SageMaker BlazingText built-in deep learning model to predict the sentiment for customer text reviews. BlazingText is a variant of FastText which is based on word2vec.

Feb 6, 2023

Train a model quickly with Amazon SageMaker Autopilot

We will use Amazon Sagemaker Autopilot to automatically train a natural language processing (NLP) model. The model will analyze customer feedback and classify the messages into positive (1), neutral (0) and negative (-1) sentiment.

Feb 5, 2023

Detect data bias with Amazon SageMaker Clarify

In Data Science and machine learning, bias can be present in data before any model training occurs. In this article we will analyze bias on a dataset, generate and analyze bias reports, and prepare the dataset for the model training.

Feb 4, 2023

Loading & Transforming Clothing Reviews Text Data with AWS

In this project we will explore text reviews for clothing products using tools from the cloud data science service AWS Sagemaker to load and visualise the data and to gain key insights from it.

Feb 3, 2023

Using Satellite Images and Deep Learning to Track Deforestation in the Amazon

In this project we will be using a deep learning model to classify satellite images of the amazon rain forest. Here the main objective is not to get the best results for this task, rather to use this dataset to illustrate the use of the Fastai deep learning library

Jan 15, 2023

NLP and Text Classification Without Deep Learning for Business Applications

Deep Learning and AI is powering some of the most recent amazing advances in text & natural language processing (NLP) applications, such as GPT-3, Chat-GPT and Dall-E but these often require specialist resources such as deep learning. With Machine Learning (ML) its possible to create useful NLP applications for businesses without using AI and Deep Learning.

Jan 8, 2023

From Machine Learning to Deep Learning From Scratch

What’s the difference between machine learning and deep learning? In this article we will explain the differences between machine learning & deep learning, and will illustrate this by building a machine learning and a deep learning model from scratch.

Dec 17, 2022

US Patent Phrase to Phrase Matching

In this project I will create a model that can associate short text phrases with the correct US patent classification.

Dec 10, 2022

Using AI to Identify Galaxies

This article covers lesson 1 the fastai 2022 course where I will create a model that can identify different types of galaxies. I will also highlight some notable differences from earlier versions of the fastai course and library.

Dec 5, 2022

Predicting 10 Year Death Risk from Health Data

In this project we will build a model to predict the 10-year risk of death of individuals from the NHANES I epidemiology dataset

Aug 6, 2022

A Prognostic Risk Score Model for Retinopathy in Diabetes Patients

In this project we will build a Prognostic risk score model for retinopathy in diabetes patients using logistic regression

Jun 11, 2022

Evaluating Healthcare Diagnostic Models

In this project we will be working with the results of the X-ray classification model for diseases we developed in the previous article, and evaluate the model performance on each of these classes using various classification metrics.

May 22, 2022

Medical Diagnosis of 14 Diseases Using Chest X-Rays

In this project, I will explore medical image diagnosis by building a state-of-the-art deep learning chest X-ray classifier using Keras that can classify 14 different medical conditions.

May 15, 2022

The International Classification of Disease System (ICD)

In this article we will look at the history of the International Classification of Diseases (ICD) system, which has been developed collaboratively so that the medical terms and information in death certificates can be grouped together for statistical purposes. In practical examples we will look at how to extract ICD-9 codes from MIMIC III database and visualise them.

Mar 18, 2022

MIMIC-III (EHR) Clinical Outcomes & Patient Level Data

In this article we will further explore the MIMIC-III critical care Electronic Health Record Dataset, looking at how we examine clinical outcomes as well as extracting indivdual patient data.

Mar 15, 2022

MIMIC-III (EHR) for Descriptive Health Analytics

In this article we will look at the MIMIC-III Electronic Health Record (EHR) database. In particular, we will learn about the design of this relational database, and what tools are available to query, extract and visualise descriptive analytics.

Mar 14, 2022

The MIMIC-III Electronic Health Record (EHR) database

In this article we will look at MIMIC-III, which is the largest publicly Electronic Health Record (EHR) database available to benchmark machine learning algorithms.

Mar 14, 2022

Validity and Bias in Epidemiology

Epidemiological studies can provide valuable insights about a disease, however a study can yield biased results for many different reasons. In this article we explore some of these factors, and provides guidance on how to deal with bias in epidemiological research.

Mar 6, 2022

Study Designs in Epidemiology

In this article, we will learn about the main epidemiological study designs, including cross-sectional and ecological studies, case-control and cohort studies, as well as the more complex nested case-control, case-cohort designs, and randomised controlled trials.

Mar 4, 2022

Measuring Disease in Epidemiology

In this article we look at the fundamental tools of Epidemiology (the study of disease) essential to conduct studies such as measures to describe the frequency of disease, how to quantify the strength of an association, how to describe different strategies for prevention, how to identify strengths and weaknesses of diagnostic tests, and when a screening programme may be appropriate.

Feb 22, 2022

Predicting Alzheimers disease using 3D MRI medical images

In this project I develop a deep learning CNN model to predict Alzheimer’s disease using 3D MRI medical images of the Hippocampus region of the brain.

Feb 6, 2022

Patient Selection for Diabetes Drug Testing

Utilizing a synthetic Diabetes patient dataset, we will create a deep learning model trained on EHR data (Electronic Health Records) to find suitable patients for testing a new Diabetes drug.

Feb 6, 2022

Pneumonia Detection From Chest X-Rays

In this project, I will analyze data from the NIH Chest X-ray 2D Medical image dataset and train a deep learning model to classify a given chest x-ray for the presence or absence of pneumonia.

Feb 6, 2022

Python Power Tools for Data Science - Pycaret Anomaly Detection

In Python Power Tools for Data Science articles I look at python tools that help automate or simplify common tasks a Data Scientist would need to perform. In this article I look at the Pycaret Anomaly Detection module and see how this can help automate this process.

Jan 2, 2022