📦 Importing Data to Firestore Database

A hands-on guide and lab documentation for importing structured data into a Firestore database using Google Cloud Shell and Node.js.

🔧 Tech Stack Used

Cloud Platform

Google Cloud Platform (GCP)
- Firestore (NoSQL database)
- Cloud Shell
- Cloud Logging
- gcloud CLI

Backend & Data Handling

Node.js (server-side scripting)
npm (package management)

Key npm Packages

@google-cloud/firestore
@google-cloud/logging
csv-parse
faker

Data Format

CSV (Comma-Separated Values)

Command Line Tools

Git
gcloud CLI

🧾 Overview

Google Cloud Platform (GCP) provides tools for managing databases, deploying AI/ML pipelines, and building full-scale applications. Firestore is one of GCP’s real-time NoSQL document databases optimised for scalability and performance.

In this lab, we will:

Set up a Firestore instance.
Write scripts to import CSV data into Firestore.
Generate synthetic customer data.
Enable logging using Google Cloud Logging.

🛠️ Prerequisites

Google Cloud account (or Qwiklabs temporary project).
Basic knowledge of Node.js and npm.
Familiarity with Cloud Shell.

🚀 Step-by-Step Guide

1. Activate Cloud Shell

Open Google Cloud Console.
Click Activate Cloud Shell (top-right).
Pre-installed tools include: gcloud, git, and npm.

2. Authenticate & Configure Project

gcloud auth list
gcloud config list project

Output:

[core]
project = <project_ID>

3. Set Up Firestore

Go to Navigation Menu > Firestore.
Click Select Native Mode.
Choose a region and click Create Database.

📌 Certainly:

Both modes offer high performance with robust consistency, although they exhibit distinct appearances and are tailored for specific usage scenarios. Native Mode excels in enabling numerous users to have concurrent access to shared data. It boasts real-time updates and a direct channel connecting your database to web or mobile clients. On the other hand, Datastore Mode prioritises exceptional throughput, catering to intensive reading and writing operations.

4. Clone Repository

git clone https://github.com/rosera/pet-theory
cd pet-theory/lab01

Inspect the package.json file for dependencies and scripts.

5. Install Dependencies

Install required libraries:

npm install @google-cloud/firestore @google-cloud/logging csv-parse

Your package.json should now include:

"dependencies": {
  "@google-cloud/firestore": "^6.4.1",
  "@google-cloud/logging": "^10.3.1",
  "csv-parse": "^4.4.5"
}

6. Import CSV Data to Firestore

Update importTestData.js:

🔹 Add Required Modules

const { promisify } = require("util");
const parse = promisify(require("csv-parse"));
const { readFile } = require("fs").promises;
const { Firestore } = require("@google-cloud/firestore");
const { Logging } = require("@google-cloud/logging");

🔹 Initialise Firestore and Logging

const db = new Firestore();

const logName = "pet-theory-logs-importTestData";
const logging = new Logging();
const log = logging.log(logName);
const resource = { type: "global" };

🔹 Firestore Batch Import Function

function writeToFirestore(records) {
  const batchCommits = [];
  let batch = db.batch();
  records.forEach((record, i) => {
    const docRef = db.collection("customers").doc(record.email);
    batch.set(docRef, record);
    if ((i + 1) % 500 === 0) {
      console.log(`Writing record ${i + 1}`);
      batchCommits.push(batch.commit());
      batch = db.batch();
    }
  });
  batchCommits.push(batch.commit());
  return Promise.all(batchCommits);
}

🔹 Main Import Function with Logging

async function importCsv(csvFileName) {
  const fileContents = await readFile(csvFileName, "utf8");
  const records = await parse(fileContents, { columns: true });
  try {
    await writeToFirestore(records);
  } catch (e) {
    console.error(e);
    process.exit(1);
  }
  console.log(`Wrote ${records.length} records`);

  const success_message = `Success: importTestData - Wrote ${records.length} records`;
  const entry = log.entry({ resource }, { message: success_message });
  log.write([entry]);
}

importCsv(process.argv[2]).catch(console.error);

7. Generate Test Data

Install Faker:

npm install [email protected]

Edit createTestData.js:

🔹 Add Logging Setup

const { Logging } = require("@google-cloud/logging");
const logName = "pet-theory-logs-createTestData";
const logging = new Logging();
const log = logging.log(logName);
const resource = { type: "global" };

🔹 Create Customer Data

async function createTestData(recordCount) {
  const fileName = `customers_${recordCount}.csv`;
  const f = fs.createWriteStream(fileName);
  f.write("id,name,email,phone\n");
  for (let i = 0; i < recordCount; i++) {
    const id = faker.datatype.number();
    const firstName = faker.name.firstName();
    const lastName = faker.name.lastName();
    const name = `${firstName} ${lastName}`;
    const email = faker.internet.email(firstName, lastName).toLowerCase();
    const phone = faker.phone.phoneNumber();
    f.write(`${id},${name},${email},${phone}\n`);
  }
  console.log(`Created file ${fileName} containing ${recordCount} records.`);

  const success_message = `Success: createTestData - Created file ${fileName} containing ${recordCount} records.`;
  const entry = log.entry({ resource }, { name: fileName, recordCount, message: success_message });
  log.write([entry]);
}

Run the following command to configure your Project ID in Cloud Shell, replacing PROJECT_ID with your Qwiklabs Project ID:

gcloud config set project PROJECT_ID

Now, set the project ID as an environment variable:

PROJECT_ID=$(gcloud config get-value project)

Run the following command in Cloud Shell to create the file customers_1000.csv, which will contain 1000 records of test data:

node createTestData.js 1000

8. Import Generated Data

Run the import:

node importTestData.js customers_1000.csv

If you see a CSV-parse error, reinstall it:

npm install csv-parse

Create and import a larger dataset (optional):

node createTestData.js 20000
node importTestData.js customers_20000.csv

✅ Conclusion

You’ve successfully:

Created test data using Faker
Imported data into Firestore using a batch write method
Logged operations using Google Cloud Logging

This hands-on lab provides a complete mini-pipeline for structured data ingestion into Firestore using GCP-native tools and Node.js.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.gitattributes		.gitattributes
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

📦 Importing Data to Firestore Database

🔧 Tech Stack Used

🧭 Table of Contents

🧾 Overview

🛠️ Prerequisites

🚀 Step-by-Step Guide

1. Activate Cloud Shell

2. Authenticate & Configure Project

3. Set Up Firestore

4. Clone Repository

5. Install Dependencies

6. Import CSV Data to Firestore

7. Generate Test Data

8. Import Generated Data

✅ Conclusion

About

Uh oh!

Releases

Packages

Niksinikhilesh045/Importing-Data-to-Firestore-Database

Folders and files

Latest commit

History

Repository files navigation

📦 Importing Data to Firestore Database

🔧 Tech Stack Used

🧭 Table of Contents

🧾 Overview

🛠️ Prerequisites

🚀 Step-by-Step Guide

1. Activate Cloud Shell

2. Authenticate & Configure Project

3. Set Up Firestore

4. Clone Repository

5. Install Dependencies

6. Import CSV Data to Firestore

7. Generate Test Data

8. Import Generated Data

✅ Conclusion

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages