How to Work with String Data in Data.olllo AI Chat
With Data.olllo AI Chat, you can clean, standardize, and extract key information from messy text data — without memorizing complex syntax or regex. Just describe what you need, and the AI will generate a ready-to-run process(dfs) function for you.

Screenshot example: Enter your requirement → Data.olllo writes and runs the correct code
1. Convert Text to Uppercase
Standardize text by making all letters uppercase:
def process(dfs):
df = dfs["df"]
df["customer_name"] = df["customer_name"].str.upper()
return df2. Extract Mobile Numbers from Text
Pull valid phone numbers from unstructured text using regex:
def process(dfs):
import re
df = dfs["df"]
df["mobile"] = df["notes"].str.extract(r'(\+?\d[\d\s-]{7,}\d)')
return df3. Extract Email Addresses
Isolate emails from messy contact info:
def process(dfs):
df = dfs["df"]
df["email"] = df["contact_info"].str.extract(r'([\w\.-]+@[\w\.-]+)')
return df4. Remove Special Characters
Keep only letters, numbers, and spaces:
def process(dfs):
df = dfs["df"]
df["clean_text"] = df["raw_text"].str.replace(r'[^A-Za-z0-9 ]+', '', regex=True)
return df5. Split and Combine Strings
Split full names into first and last names, or merge columns:
def process(dfs):
df = dfs["df"]
name_split = df["full_name"].str.split(" ", n=1, expand=True)
df["first_name"] = name_split[0]
df["last_name"] = name_split[1]
df["full_name_merged"] = df["first_name"] + " " + df["last_name"]
return df💡 Pro Tips
- You can combine multiple cleaning steps into a single AI prompt.
- Always preview extracted data — regex patterns can sometimes capture unintended matches.
- For complex extractions, specify examples in your prompt for better accuracy.