What is a Vector Database?
Structured, Semi-Structured, & Unstructured Data
Data professionals often categorize information into three broad types: structured, semi-structured, and unstructured.
Structured data fits neatly into tables. Think of the file browser on your computer: each row is a file, and each column is a property — filename, file type, size, date created, date modified. You can sort by any of these columns with a click: A–Z, largest to smallest, newest to oldest.
Example: A relational database (SQLite) — predictable columns, predictable queries. (Interactive: toggle table columns on/off to see NULL values appear)
Semi-structured data starts to bend the rules. Not every entry has the same set of properties, so rigid columns start filling up with “NULL” (empty) values. For example:
A text file might have a word count.
An image might have a resolution and GPS location.
Both still have common fields like name, size, and date.
You could make a giant table with a column for every possible property, but most cells would be empty. Object-based databases (like MongoDB) solve this by letting each entry store only the properties it needs — no wasted columns.
Try It Out! Toggle table columns on and off to see how structured data stays consistent while semi-structured data fills with empty values (NULLs).