feat: bigquery_fetcher entrypoint by matt-codecov · Pull Request #67 · getsentry/usage-accountant

matt-codecov · 2026-06-11T23:51:52Z

as title

add optional timestamp accumulator for times when you're querying a snapshot. you can put in the snapshot time
move common types/functions from datadog_fetcher.py to new fetcher_utils.py (and same for tests)
write bigquery_fetcher.py entrypoint which works basically the same as the datadog one but with a BQ client
packaging/dependency/Dockerfile boilerplate

i'm wiring some inventory data into BigQuery for objectstore COGS. reusing usage-accountant to push BQ rollups through Kafka saves us a good deal of trouble.

Ref FS-210

sentry · 2026-06-11T23:54:24Z

+                app_feature=app_feature,
+                amount=int(amount),
+                usage_type=unit,
+                timestamp=None if timestamp is None else int(timestamp),


Bug: The code calls int(timestamp) on a value from BigQuery. This will raise a TypeError because the BigQuery client returns datetime.datetime objects for timestamp columns, which cannot be cast to an integer directly.
_{Severity: CRITICAL}

Suggested Fix

Before converting the timestamp to an integer, check if it is a datetime.datetime object. If it is, convert it to a Unix epoch timestamp first (e.g., by using timestamp.timestamp()) before casting to int. This will correctly handle the type returned by the BigQuery client.

Prompt for AI Agent

Review the code at the location below. A potential bug has been identified by an AI agent. Verify if this is a real issue. If it is, propose a fix; if not, explain why it's not valid. Location: py/usageaccountant/bigquery_fetcher.py#L75 Potential issue: The code at `py/usageaccountant/bigquery_fetcher.py:75` attempts to convert a `timestamp` value from a BigQuery result to an integer using `int(timestamp)`. However, the BigQuery Python client library automatically converts `TIMESTAMP` columns into `datetime.datetime` objects. Calling `int()` directly on a `datetime.datetime` object will raise a `TypeError`, causing the data fetching process to fail whenever a query includes a standard timestamp column. The existing tests do not detect this because they use mock dictionaries with integer timestamps, not actual BigQuery `Row` objects.

_{Did we get this right? 👍 / 👎 to inform future reviews.}

lcian · 2026-06-12T11:56:10Z


+[project.optional-dependencies]
+# Only needed for the `bigquery_fetcher` entrypoint
+bigquery = ['google-cloud-bigquery>=3.0.0']


Do we need to update the uv.lock as well? Although not sure if that's used anywhere in practice, maybe only for development.

lcian · 2026-06-12T12:09:29Z

+        bucket_time = now if timestamp is None else timestamp
        key = UsageKey(
-            floor(now / self.__granularity_sec) * self.__granularity_sec,
+            floor(bucket_time / self.__granularity_sec)
+            * self.__granularity_sec,


Maybe better?

Suggested change

bucket_time = now if timestamp is None else timestamp

key = UsageKey(

floor(now / self.__granularity_sec) * self.__granularity_sec,

floor(bucket_time / self.__granularity_sec)

* self.__granularity_sec,

bucket_time = timestamp if timestamp is not None else floor(now / self.__granularity_sec) * self.__granularity_sec

key = UsageKey(

bucket_time,

lcian · 2026-06-12T12:16:32Z

+    query_list = parse_and_assert_query_file(query_file)
+    record_list: List[UsageAccumulatorRecord] = []
+    for query_dict in query_list:
+        query = query_dict["query"]


I hope the query is pretty concise, otherwise I don't really love the idea of putting a large sql query in a JSON.

the Bigtable query for us will look something like:

SELECT SPLIT(rowkey, '/')[OFFSET(0)] AS app_feature, -- TODO mapping SUM( -- `m` field size: object metadata IFNULL(BYTE_LENGTH(COALESCE(fg.m.cell.value, fm.m.cell.value)), 0) -- `p` field size: object payload -- check `m` for precomputed size, fallback to scanning `p` + IFNULL( COALESCE( LAX_INT64( JSON_QUERY( TO_JSON(COALESCE(fg.m.cell.value, fm.m.cell.value)), "$.size")), BYTE_LENGTH(COALESCE(fg.p.cell.value, fm.p.cell.value))), 0) -- `t` field size: tombstone metadata + IFNULL(BYTE_LENGTH(COALESCE(fg.t.cell.value, fm.t.cell.value)), 0) -- `r` field size: tombstone + IFNULL(BYTE_LENGTH(COALESCE(fg.r.cell.value, fm.r.cell.value)), 0)) AS amount FROM `eng-dev-sbx--files-1.objectstore.bigtable_objectstore` GROUP BY usecase ORDER BY total_bytes DESC;

GCS one should be shorter bc size is precomputed

lcian

Left some suggestions. Please address the CI checks as well.

linear-code · 2026-06-13T00:12:00Z

FS-210

feat: bigquery_fetcher entrypoint

a160ba5

matt-codecov requested a review from a team June 11, 2026 23:51

sentry Bot reviewed Jun 11, 2026

View reviewed changes

lcian reviewed Jun 12, 2026

View reviewed changes

Comment thread Dockerfile

lcian reviewed Jun 12, 2026

View reviewed changes

lcian approved these changes Jun 12, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: bigquery_fetcher entrypoint#67

feat: bigquery_fetcher entrypoint#67
matt-codecov wants to merge 1 commit into
mainfrom
matth/add-bigquery-fetcher

matt-codecov commented Jun 11, 2026 •

edited

Loading

Uh oh!

sentry Bot Jun 11, 2026

Uh oh!

Uh oh!

lcian Jun 12, 2026

Uh oh!

lcian Jun 12, 2026 •

edited

Loading

Uh oh!

lcian Jun 12, 2026

Uh oh!

matt-codecov Jun 12, 2026

Uh oh!

lcian left a comment

Uh oh!

linear-code Bot commented Jun 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

matt-codecov commented Jun 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sentry Bot Jun 11, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

lcian Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

lcian Jun 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lcian Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

matt-codecov Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

lcian left a comment

Choose a reason for hiding this comment

Uh oh!

linear-code Bot commented Jun 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

matt-codecov commented Jun 11, 2026 •

edited

Loading

lcian Jun 12, 2026 •

edited

Loading