AI/ML Research - Robotic Content

Understanding Text Generation Parameters in Transformers

by AI Generated Robotic ContentAI/ML Research April 22, 2025Comments are Disabled

This post is divided into seven parts; they are: – Core Text Generation Parameters – Experimenting with Temperature – Top-K and Top-P Sampling – Controlling Repetition – Greedy Decoding and Sampling – Parameters for Specific Applications – Beam Search and Multiple Sequences Generation Let’s pick the GPT-2 model as an example.

Further Applications with Context Vectors

by AI Generated Robotic ContentAI/ML Research April 19, 2025Comments are Disabled

This post is divided into three parts; they are: • Building a Semantic Search Engine • Document Clustering • Document Classification If you want to find a specific document within a collection, you might use a simple keyword search.

Building a RAG Pipeline with llama.cpp in Python

by AI Generated Robotic ContentAI/ML Research April 19, 2025Comments are Disabled

Using llama.

Detecting & Handling Data Drift in Production

by AI Generated Robotic ContentAI/ML Research April 18, 2025Comments are Disabled

Machine learning models are trained on historical data and deployed in real-world environments.

Quantization in Machine Learning: 5 Reasons Why It Matters More Than You Think

by AI Generated Robotic ContentAI/ML Research April 18, 2025Comments are Disabled

Quantization might sound like a topic reserved for hardware engineers or AI researchers in lab coats.

Applications with Context Vectors

by AI Generated Robotic ContentAI/ML Research April 17, 2025Comments are Disabled

This post is divided into two parts; they are: • Contextual Keyword Extraction • Contextual Text Summarization Contextual keyword extraction is a technique for identifying the most important words in a document based on their contextual relevance.

Generating and Visualizing Context Vectors in Transformers

by AI Generated Robotic ContentAI/ML Research April 15, 2025Comments are Disabled

This post is divided into three parts; they are: • Understanding Context Vectors • Visualizing Context Vectors from Different Layers • Visualizing Attention Patterns Unlike traditional word embeddings (such as Word2Vec or GloVe), which assign a fixed vector to each word regardless of context, transformer models generate dynamic representations that depend on surrounding words.

5 Lessons Learned Building RAG Systems

by AI Generated Robotic ContentAI/ML Research April 15, 2025Comments are Disabled

Retrieval augmented generation (RAG) is one of 2025’s hot topics in the AI landscape.

Understanding RAG Part X: RAG Pipelines in Production

by AI Generated Robotic ContentAI/ML Research April 15, 2025Comments are Disabled

Be sure to check out the previous articles in this series: •

Defending against Prompt Injection with Structured Queries (StruQ) and Preference Optimization (SecAlign)

by AI Generated Robotic ContentAI/ML Research April 12, 2025Comments are Disabled

Recent advances in Large Language Models (LLMs) enable exciting LLM-integrated applications. However, as LLMs have improved, so have the attacks against them. Prompt injection attack is listed as the #1 threat by OWASP to LLM-integrated applications, where an LLM input contains a trusted prompt (instruction) and an untrusted data. The data may contain injected instructions …