Applying AI techniques from drug discovery to LLMs to reduce hallucinations

December 5, 2024

Revolutionary GitHub projects: Automated drug discovery with AI

The integration of artificial intelligence (AI) into drug discovery is revolutionizing the pharmaceutical industry. Open source projects on GitHub play a crucial role in this. Below we present some of the most innovative projects that are driving automated drug discovery using AI.

DeepChem: Open Platform for Deep Learning in Chemistry

DeepChem is a leading open source library that makes deep learning accessible for chemical applications. It provides tools for:

Advertising

Molecular modeling
Protein structure prediction
Materials science

Through its user-friendly interface, DeepChem enables researchers to implement complex AI models without in-depth programming knowledge. This accelerates the discovery of new drugs and promotes innovation in the industry.

MoleculeNet: Benchmarking for AI in Chemistry

MoleculeNet is a comprehensive benchmarking system specifically designed for machine learning in chemical research. It offers:

Standardized datasets
Evaluation metrics
Comparison of model performance

By providing consistent benchmarks, MoleculeNet facilitates the comparison of different AI models and thus promotes progress in drug discovery.

ATOM Modeling PipeLine (AMPL): Accelerated drug discovery

The ATOM Modeling PipeLine is a project of the ATOM consortium that aims to accelerate drug development using machine learning. AMPL offers:

Modular pipeline for data preparation
Automated model training
Extensible frameworks for different use cases

With AMPL, researchers can efficiently build complex models and thus shorten the time from discovery to market launch of new drugs.

Chemprop: Molecular property prediction with deep learning

Chemprop uses graphical neural networks to predict molecular properties. Its features include:

High prediction accuracy
Customizable model architectures
Support for various chemical datasets

Chemprop has achieved outstanding results in several competitions and is a valuable tool for AI-assisted chemistry.

DeepPurpose: Universal Toolkit for Drug Discovery

DeepPurpose is a comprehensive deep learning toolkit for drug discovery. It offers:

Integration of different models and datasets
Easy implementation of predictive models
Applications in protein-ligand interactions

Through its versatility, DeepPurpose enables researchers to quickly and efficiently identify new therapeutic candidates.

OpenChem: Special deep learning framework for chemical applications

OpenChem is a deep learning framework tailored to chemistry. It is characterized by:

Support for molecule generation
Property prediction
Flexibility in model design

OpenChem promotes the development of new methods in chemical AI and helps accelerate research.

The open source community on GitHub is pushing the boundaries of automated drug discovery with these projects. Combining AI and chemistry opens up new opportunities to develop therapeutic solutions more efficiently and precisely. These innovations have the potential to change the future of medicine for the long term.

Application of AI research models from drug research to the distillation of AI models

TheThe AI models and methods used offer innovative approaches that can be transferred to the distillation of AI models. Although the two fields appear different at first glance, they share common techniques and challenges that make them useful.

Use of Application

Applying research models from drug discovery to AI model distillation makes sense because:

Common Methods: Both fields use advanced machine learning techniques such as deep learning, neural networks and graph-based models.
Complexity Reduction: In drug discovery, complex molecular structures are represented in a simplified manner, similar to the reduction of large AI models into more compact forms.
Optimization and Efficiency: Both drug discovery and model distillation aim to achieve efficient and powerful results with limited resources. achieve.

How it can be applied

1. Graph Neural Networks (GNNs) for structural understanding

In drug research, Graph Neural Networks are used to analyze molecular structures. These techniques can be used in model distillation to understand the structure of large models and extract essential features for the smaller model.

2. Transfer Learning and Feature Extraction

The models from projects such as DeepChem or Chemprop use transfer learning to learn from existing data sets. Similarly, in distillation, a large pre-trained model can serve as a starting point from which essential features are transferred to the smaller model.

3. Multi-task learning for versatile models

Projects such as MoleculeNet use multi-task learning to train models that can handle multiple tasks simultaneously. This method can be used in distillation to create compact models that still perform versatile functions.

4. Optimization techniques from drug discovery

Optimization approaches from drug discovery, such as fine-tuning hyperparameters or using evolutionary algorithms, can be applied to make distilled models more efficient.

5. Data augmentation and generation

Generating synthetic data is key in projects like DeepPurpose. Similar techniques can be used to improve the training process of the student model in distillation, especially when limited data is available.

Practical implementation steps

Analysis of model structure: Using GNNs to identify important components of the teacher model.
Feature selection: Extracting critical features that are crucial for the model's performance.
Efficient architecture designs: Adapting model architectures from drug discovery for more compact model structures.
Joint training: Implementing multi-task learning to train the student model on multiple tasks to improve generalization ability. increase.

The integration of methods from automated drug discovery into the distillation of AI models opens up new ways to increase efficiency and reduce complexity. By transferring proven techniques, powerful, compact models can be developed that meet the requirements of modern AI applications. This interdisciplinary approach promotes innovation and accelerates progress in both research fields.

Extension: Application of AI techniques from drug discovery to LLMs to reduce hallucinations

Advances in artificial intelligence have revolutionized both drug discovery and the development of Large Language Models (LLMs). An interesting question is whether the techniques from automated drug discovery can help to increase the prediction accuracy of LLMs and reduce hallucinations. In the following, we explore this possibility and analyze whether such an application makes sense and whether these techniques are already used in LLMs.

Connection between AI-Techtechniques in chemistry and LLMs

1. Graph Neural Networks (GNNs) and structural analysis

In drug research, Graph Neural Networks are used to understand and predict the complex structures of molecules. GNNs model data as graphs, which is natural in chemistry since molecules are made up of atoms (nodes) and bonds (edges).

Application to LLMs:

Syntax trees as graphs: Similar to molecules, sentences can be represented as graphs, where words are nodes and grammatical relations are edges.
Improved context modeling: GNNs could be used to better model the relationships between words in a sentence, which could improve contextualization in LLMs.

2. Fuzziness and uncertainty estimation

In drug discovery, uncertainty estimation is crucial to assess the reliability of predictions.

Application to LLMs:

Reducing hallucinations: By incorporating uncertainty estimates, LLMs could better evaluate their own predictions and be less inclined to provide incorrect or hallucinated information.
Confidence metrics: Implementing metrics that indicate how confident the model is in its answer.

3. Multi-task learning and transfer learning

Projects like MoleculeNet use multi-task learning to train models that predict multiple properties simultaneously.

Application to LLMs:

Simultaneous optimization of multiple goals: LLMs could be trained to optimize both next word prediction and content correctness.
Transfer of domain knowledge: Transfer learning allows models to use specific expertise from chemistry to make more precise statements in that domain.

4. Data augmentation and synthetic data generation

In chemistry, synthetic data is used to improve models, especially when real-world data is limited.

Application to LLMs:

Expanding training datasets: generating additional, high-quality text data to improve the training process.
Improving generalization ability: More diverse data allows the model to generalize better and hallucinate less.

Does the application make sense?

Transferring techniques from AI-assisted drug discovery to LLMs theoretically makes sense, as both fields use complex data structures and machine learning. Some reasons are:

Common mathematical foundations: Both fields use neural networks and optimization methods.
Need for accuracy and reliability: In both medicine and information processing, precise predictions are crucial.

Challenges

Different data types: Chemical data is structurally different from natural language.
Scalability: LLMs are often significantly larger and more complex than models in chemistry, which makes direct application difficult.

Are these techniques already used in LLMs?

Many of the techniques mentioned are already used in LLMs in some form integrated:

Uncertainty estimation: Some models use Bayesian approaches or Monte Carlo dropout to model uncertainty.
Graph-based models: While GNNs are not used directly in LLMs, there are models that consider syntax trees or dependency graphs.
Multi-task and transfer learning: LLMs like GPT-4 use transfer learning and can be fine-tuned for multiple tasks.

Potential innovative Approaches

Despite the existing techniques, there is potential for new approaches:

Hybrid models: Combination of LLMs with GNNs for better context modeling.
Chemistry-inspired optimization: Use of optimization methods from chemistry to improveing the training procedures of LLMs.
Interdisciplinary datasets: Incorporating data from chemistry to make LLMs more accurate in specialized areas.

Applying techniques from automated drug discovery to LLMs offers exciting opportunities to improve prediction accuracy and reduce hallucinations. While some methods are already used in LLMs, there is room for further innovation through an interdisciplinary approach. The challenges lie mainly in the different data types and scalability. Nevertheless, collaboration between these two fields could lead to significant advances in AI research.

Short thought experiment: Does it make sense?

Chemistry and natural language are different at first glance, but both are systems with complex rules and structures. The techniques for modeling and prediction in chemistry could therefore provide valuable input for natural language processing. It is important to be open to interdisciplinary approaches, as innovation often arises at the interfaces of different disciplines.

Integrating AI techniques from drug discovery into the development of LLMs could be a promising way to further increase the performance of these models. By learning from each other, both areas can benefit from each other and together open up new horizons in AI research.

Implementation to reduce hallucinations in LLMs using Hugging Face

Below, we show how to create a language model with uncertainty estimation using Hugging Face and Python to reduce hallucinations. We use techniques inspired by methods used in automated drug discovery, in particular uncertainty estimation by Monte Carlo dropout.

Requirements

Python 3.6 or higher
Installed libraries:
- transformers
- torch
- datasets

You can install the required libraries using the following command:

pip install transformers torch datasets

Code implementation

import torch
from transformers import AutoTokenizer, AutoModelForCausalLM
import torch.nn.functional as F
import numpy as np

# Loading the tokenizer and model
model_name = 'gpt2'
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(model_name)

# Enabling dropout in evaluation mode too
def enable_dropout(model):
""Enables dropout layers in the model during evaluation."""
for module in model.modules():
if isinstance(module, torch.nn.Dropout):
module.train()

# Function for generating with uncertainty estimation
def generate_with_uncertainty(model, tokenizer, prompt, num_samples=5, max_length=50):
model.eval()
enable_dropout(model)
inputs = tokenizer(prompt, return_tensors='pt')
input_ids = inputs['input_ids']

# Multiple predictions for uncertainty estimation
outputs = []
for _ in range(num_samples):
with torch.no_grad():
output = model.generate(
input_ids=input_ids,
max_length=max_length,
do_sample=True,
top_k=50,
top_p=0.95
)
outputs.append(output)

# Decoding the generated sequences
sequences = [tokenizer.decode(output[0], skip_special_tokens=True) for output in outputs]

# Calculating the uncertainty (entropy)
probs = [] 
for output in outputs:
 with torch.no_grad():
 logits = model(output)['logits']
 prob = F.softmax(logits, dim=-1)
 probs.append(prob.cpu().numpy())

 # Calculate average entropy
 entropies = []
 for prob in probs:
 entropy = -np.sum(prob * np.log(prob + 1e-8)) / prob.size
 entropies.append(entropy)

 avg_entropy = np.mean(entropies)
 uncertainty = avg_entropy

 # Selection of the most frequently occurring sequence
 from collections import Counter
 sequence_counts = Counter(sequences)
 most_common_sequence = sequence_counts.most_common(1)[0][0]

 return {
'generated_text': most_common_sequence,
'uncertainty': uncertainty
}

# Example usage
prompt = "The impact of artificial intelligence on medicine is"

result = generate_with_uncertainty(model, tokenizer, prompt)
print("Generated text:")
print(result['generated_text'])
print("nEstimated uncertainty:", result['uncertainty'])

Code explanation

Loading model and tokenizer: We use the pre-trained GPT-2 model from Hugging Face.

tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(model_name)

Enable dropout: We use the enable_dropout function to enable the dropout layers during evaluation to enable Monte Carlo dropout.
```
def enable_dropout(model):
for module in model.modules():
if isinstance(module, torch.nn.Dropout):
module.train()
```
Generation with uncertainty estimation: The generate_with_uncertainty function performs multiple predictions and calculates the uncertainty based on the entropy of the output distributions.
```
def generate_with_uncertainty(model, tokenizer, prompt, num_samples=5, max_length=50):
# Function implemented as above shown
```
Uncertainty calculation: The entropy of the probability distributions is calculated to estimate the uncertainty. A higher entropy indicates a higher uncertainty.
Selecting the best sequence: We choose the most frequently generated sequence as the final output because it is most likely to be correct.

Using GitHub repositories

For extended functionality and advanced methods, the following GitHub repositories may be useful:

Bayesian Transformer Networks: Bayesian Transformers
- Implementing Transformers with Bayesian methods for uncertainty estimation.
Knowledge-Augmented Language Models: K-Adapter
- An approach to integrating factual knowledge into language models to reduce hallucinations.

Extension options

Fine-tuning with domain-specific data: By fine-tuning the model with specific data sets, the accuracy can be increased.

from datasets import load_dataset

# Loading a domain-specific dataset
dataset = load_dataset('your_dataset')

# Insert fine-tuning code here

Integration of knowledge graphs: Integration of external knowledge databases such as Wikidata to validate and supplement the generated content.
Use of larger models: Use of more advanced models such as GPT-3 or GPT-4 via corresponding APIs for better results.

Conclusion

By applying uncertainty estimates and techniques from automated drug discovery, we can Increase reliability of language models and reduce unwanted hallucinations. The provided implementation serves as a starting point and can be further developed to meet specific requirements.

Note: The implementation shown above is a simplified example. In a production environment, other aspects such as efficiency, scalability and ethical considerations should be taken into account.

Author: Thomas Poschadel

Transfer Chemical learning to LLMs

🐾 Cats, cuisine, and culture shock? A culinary-critical journey through Vietnam – with a touch of philosophy ..read more..

Pandemic Outbreak in the Post-Quantum Era Joke, Kawały, Dowcipy, FICTION ..read more..

The Sick Horde / The Beginning of Intelligence h4h4 Why you better not go out ..read more..

Overwhelming Mentally Ill People Due to Therapeutic Demands: Psychological and Legal Consequences Using Single Mothers as an Example Abstract..read more..

Make jokes about: How to Build a Ghosttown: WITZ,Joke,Kawały,Dowcipy Haha, that has some really absurd ..read more..

Shielding Hex-Pattern: Advances in Multimodal Force Field Protection Using EM, Sonic, Ion, and Proton Technologies Th..read more..

Extinguishing the Sun with Ceramics - A Theoretical Consideration of Ceramic Cooling Mechanisms in Stellar Processes 04-..read more..

The Strategic Imperative of AI-Compute Power in Y3k: Decompression, Quantum Paradigms, and the New Cyberwar Landscape ..read more..

Cyborg Jokes from the Time Rider WITZ,Joke,Kawały,Dowcipy ..read more..

The polymorphic children's play dough of the post-quantum era April 25, 2025 ..read more..

Quantum Entanglement and Attention Detection: Experimental Approaches for Detecting Observation by Quantum Objects 04-23..read more..

Nikola Test - Invention of the Radio WITZ,Joke,Kawały,Dowcipy Dude, you know. It didn't sound. It didn'..read more..

The Fight for Solaris 2: Lost Survivors without Sun When love becomes a weapon - and the Big Bang the final answer..read more..

The Fight for Solaris - The Last Mirror of Our Humanity A science fiction novel as a reckoning with the future It b..read more..

Mind Transference During Sex – Myth, Magic, or Measurable Reality? Sex is more than just physical union. It's an experi..read more..

Bent Lead Fiber Nanotubes as Novel Radiation Shielding in Space: Potential, Challenges, and Future Perspectives Abstra..read more..

Development and Application of Artificial Cyberbone Replacement Systems Based on an Aluminum-Copper-Moss Alloy with Penetrating Nanochannel Structures..read more..

How the Cosmos Might Just Be a Really Big Soap Bubble Abstract Gravity has always been that quiet, invisible roommate who never pays..read more..

The Tesla Barrel Organ: When the Universe Looks at You WITZ,Joke,Kawały,Dowcipy A..read more..

Arrow 3 System as the Ultimate Asteroid Defense System April 8, 2025 ..read more..

Jump Drive by TJP How to Jump to other Universe 2025-05-04..read more..

Thermodynamic Manipulation of Vacuum States for Wormhole Formation: A Hypothetical Framework Author: Thomas Jan Poschade..read more..

Wind turbines as weather control? How AI can help prevent hurricanes and generate targeted rain Wind turbines not only generate renewable ener..read more..

Solar cells as inductive chargers: How they can be converted to charge electric cars 04.01.2025..read more..

Black Solar Cells and Their Impact on Global Warming Solar cells are considered..read more..

Lost in the Depths of HighOS: WITZ,Joke,Kawały,Dowc..read more..

HighOS: When the Future Becomes Indifference WITZ,Jo..read more..

When the Jaguar Becomes a Squirrel: A Look into the Strange World of Nut-Buried Predators WITZ,Joke,Kawały,Dowcipy..read more..

Love is a complex and multifaceted concept that is difficult to capture in a single definition. It is a strong feeling of affection, connectio..read more..

Now* we’re cooking with satire! 🔥 WITZ,Joke,Kawały,Dowcipy DEEPSEEK..read more..

Here are 10 fictional Predator laws WITZ,Joke,Kawały,Dowcipy * Honor the hunter. ..read more..

ChatGPT: The AI that calculates the universe while you're still thinking about the question WITZ,Joke,Kawały,Dowcipy..read more..

Gemma 2: The AI that calculates the universe while you're still thinking about the question WITZ,Joke,Kawały,Dowcipy..read more..

LLaMA 3.3: The AI that thinks faster than you can google WITZ,Joke,Kawały,Dowcipy There are artificial ..read more..

Here comes a nerd gem made up of **brainfuck code** and **Turing test chaos** – with plenty of tongue-in-cheek humor and retro computing absurdit..read more..

Here comes a portion of satirical political humor about **XXL parliaments** and *salary increases* - of course with a wink and fictitious exaggerations..read more..

Here comes a harmless, bizarre **Matrix-Psychiatry mix** - without any stigma, just with a wink and pop culture chaos. 😄💊 ..read more..

Here comes a satirical **Bitcoin virus joke** with blockchain chaos, forks and a touch of crypto absurdity. 😄🔗 ..read more..

Here comes the dark romantic comedy collab between **HAL 9000** and **Siri** - including roses, sarcasm and AI drama. 🌹🤖💔 ..read more..

Here comes a portion of tongue-in-cheek satire about **rich people** - of course purely fictional, exaggerated and with a love of absurd luxury. 😄�..read more..

Here comes the darkly satirical take on the **ransomware hacker world** WITZ,Joke,Kawały,Dowcipy ..read more..

Here comes the satirical reckoning with the **botched construction** and the mysteriously exploding costs - of course with a lot of tongue-in-cheek and..read more..

"Why did the zero-intelligence aliens fail to invade Earth?* WITZ,Joke,Kawały,Dowcipy ..read more..

Here comes a dark, sarcastic comedy mix of **stalker logic**, **0 IQ romance** and toxic love - of course purely fictional and with eye-rolling humor. ..read more..

Here comes the NATO version of satire - with a wink and a touch of geopolitical absurdity. 😄🌍 WITZ,Joke,Kaw..read more..

Here comes a satirical portion of Bundestag humor with the SPD, CDU and co. - of course in a fun format and without any bad intentions. 😄🇩🇪&nb..read more..

Let’s blend the chaotic "zero intelligence" vibe with the zen art of mandala drawing WI..read more..

Here's a satirical dose of **technology fear** - with a wink at the paranoia that secretly haunts us all. 😱📱 ..read more..

Auto Learn Cluster Software (ALCS) - Steps to Realize Distributed AI Computing over the Internet December 4, 2024..read more..

Future vision of a modern society November 30, 2024 Imagine a world in which people and artificia..read more..

Scenario of a global energy shortage due to exponential global warming and resource scarcity 11/20/2024 T..read more..

Time Reverse Sight 02/20/2024 The Time Reverse Sight describes a technology that uses artificial intellig..read more..

Wooden satellites to defend against aliens 02/21/2024 The concept of defending against aliens using tree ..read more..

February 16, 2024 Is a fusion reactor possible in which uranium serves as fuel, from which hydrogen is extracted a..read more..

Being, becoming, big and the smallest particle, everything ChatGPT Being is a fundamental concept that encom..read more..

Software Brain Computer Interfaces 27.12.2022 Create a code for AI and human interaction by mind and thou..read more..

Intelligent Frubber 27.12.2020 Frubber is skin-like and can be used as a coating for robots or androids...read more..

Nanobots (quantumbots) in the circuit 07.11.2020 WITZ,Joke,K..read more..

Induction into isolated meshes 22.10.2020 How do you penetrate isolated networks that are cut off from th..read more..

Terra Scale Internet through old gas lines 31.08.2020 Gas or natural gas pipelines would make it possible..read more..

Neutrino bombs from the vastness of the quantum planes 28.08.2020 No one knows what a neutrino bomb is. I..read more..

Gold in the nuclear reactor. 25.08.2020 When the first nuclear reactors in the US went online, the power ..read more..

Logical circuits of computers at the quantum level are shrinking. 21.08.2020 Terra Scale Processing requi..read more..

Energy generation from exotic particles. 22.08.2020 With the help of further discoveries from quantum com..read more..

Implanted augmentations 21.08.2020 In short, the intermediate route to the cyborg are improvements such a..read more..

Particle accelerator in silicon chip size. 09.01.2020 Physicists at Stanford University have built a chip..read more..

Fusion reactor as a gravitational generator. 08.01.2020 Researchers have successfully succeeded in operat..read more..

First DNA/ bio-computer that calculates the square root from 900. 05.01.2019 Recently, he has been able t..read more..

Intel and Google & Co. are developing machine programming. 30.12.2019 Developers at Intel are using m..read more..

Data transfer at the quantum level with the help of silicon chips. 28.12.2019 For the first time, researc..read more..