DAZL Documentation | Data Analytics A-to-Z Processing Language

Understanding nollejBase

general

slug: tutorial-understanding-nollejbase

What Is nollejBase?

nollejBase is your virtual data warehouse — but unlike traditional warehouses, it doesn’t store raw data. Instead, it captures knowledge about your data: metadata, summaries, metrics, and analytical results.

It works seamlessly with DAZL: DAZL pipelines process data, and nollejBase stores the insights, making it easy to reuse, analyze, and visualize your results without moving all your raw data.

Why nollejBase Matters

Traditional data warehouses require you to:

Extract data from multiple sources
Transform and load everything into a central database
Maintain heavy infrastructure for storage and queries

With nollejBase:

Your raw data stays where it lives
Only insights and summaries are stored
Analytical workflows are lightweight and flexible

The result: all the power of a warehouse without the overhead.

Core Concepts

1. Datasets

A dataset in nollejBase represents a collection of data or a virtual table.

Can be external (in a database, CSV, API, etc.)
Or internal (generated via DAZL pipelines)

Datasets can be referenced by name in DAZL scripts for transformation or analysis.

2. Metadata

nollejBase automatically stores metadata about each dataset:

Column names and types
Dimensions vs measures
Relationships between tables
Analytical properties like frequency counts, summaries, or cubes

Metadata allows DAZL to generate insights without reprocessing raw data each time.

3. Summaries & Results

When DAZL pipelines are executed, nollejBase stores the outputs:

Frequency tables
Aggregated metrics
Cubes and cross-tabs
Charts and dashboards

These outputs are ready to use in subsequent scripts, dashboards, or reports.

4. Versioning and Reuse

nollejBase supports tracking multiple versions of datasets and analytical results:

Keep snapshots of analyses over time
Reuse prior results in new pipelines
Combine insights across datasets without reprocessing raw data

Example: Storing a DAZL Pipeline Output

Here’s a simple example of a DAZL pipeline writing to nollejBase:

steps:
  - loadInline:
      data:
        - {region: "North", product: "Widget", price: 10, quantity: 5}
        - {region: "South", product: "Gadget", price: 15, quantity: 2}

  - calculate:
      assign:
        total: "$price * $quantity"

  - freq:
      columns: [region]

  - store:
      dataset: "sales_summary_by_region"

Explanation:

loadInline loads the sample dataset
calculate adds a new total column
freq counts rows per region
store saves the resulting summary in nollejBase as a dataset called "sales_summary_by_region"

This dataset is now ready for dashboards, reports, or further analysis.

How DAZL and nollejBase Work Together

DAZL handles the logic: loading, transforming, analyzing, and visualizing.
nollejBase provides the memory: storing results, summaries, and metadata.
Together, they create a pipeline for knowledge, not just data.