What is a Vector Database?

Structured, Semi-Structured, & Unstructured Data

Data professionals often categorize information into three broad types: structured, semi-structured, and unstructured.

Structured data fits neatly into tables. Think of the file browser on your computer: each row is a file, and each column is a property — filename, file type, size, date created, date modified. You can sort by any of these columns with a click: A–Z, largest to smallest, newest to oldest.

Example: A relational database (SQLite) — predictable columns, predictable queries. (Interactive: toggle table columns on/off to see NULL values appear)

Semi-structured data starts to bend the rules. Not every entry has the same set of properties, so rigid columns start filling up with “NULL” (empty) values. For example:

You could make a giant table with a column for every possible property, but most cells would be empty. Object-based databases (like MongoDB) solve this by letting each entry store only the properties it needs — no wasted columns.

Try It Out! Toggle table columns on and off to see how structured data stays consistent while semi-structured data fills with empty values (NULLs).