minor update to the repo to ensure running in all machines #14

MaxHao56 · 2025-09-27T01:14:06Z

Graph fusion layer with attention mask = None
sweeper = basics
dataset bool

Optional Changes on the side. YAML file in the configs for giga pretrain needs to have match to your directory in you local or cluster machine

liamhebert

Few minor comments on root_dir, which when resolved are good to submit :)

src/configs/dataset/giga_pretrain.yaml

src/configs/dataset/hateful_discussions.yaml

liamhebert

Still need to review EmbeddingGenerator, more soon.

logs/train/runs/2025-10-24_04-31-01/.hydra/config.yaml

src/checkpt.py

src/debug_local.sh

src/run_graphormer_hateful_discussions.sh

src/test_data.embeddings.npy

src/test_embedding.py

liamhebert · 2025-11-17T15:40:44Z

src/test_embedding.py

+    """Flatten a discussion tree into tensors for model input."""
+    dut.compute_relative_distance(tree)
+
+    flat = {


We are missing a few parameters here. Namely, is_root and distances. These should be already attributes of node (I think).

MDT-2/src/tasks/dataset.py

Lines 860 to 867 in 5f50b4a

if is_root:

parent_id = node["id"]

if node["id"] not in result["id"]:

node["images"] = node["images"][0] if node["images"] else None

result["images"].append(node["images"])

result["distances"].append(node["distances"])

liamhebert · 2025-11-17T15:41:43Z

src/test_embedding.py

+        "out_degree": out_degree,
+        "attn_bias": torch.zeros((n, n)),
+        "distance": torch.zeros((n, n, 2)),
+        "distance_index": torch.zeros((n, n), dtype=torch.int16),


This doesn't feel right... can you double check whether there should be a value here?

liamhebert · 2025-11-17T15:42:00Z

src/test_embedding.py

+        "input_ids": tokenized_text["input_ids"],
+        "attention_mask": tokenized_text["attention_mask"],
+        "token_type_ids": tokenized_text.get(
+            "token_type_ids", torch.zeros_like(tokenized_text["input_ids"])


likewise, we should be careful here

Replaced manual path setup with rootutils for better management.

Removed comments about batch size command validation.

Max Hao and others added 4 commits September 26, 2025 21:23

minor update to the repo to ensure running in all machines

071980b

requirements

eebd3f7

requirements

cc1cae4

Apply pre-commit fixes

7eabbdf

liamhebert requested changes Oct 3, 2025

View reviewed changes

src/configs/dataset/giga_pretrain.yaml Outdated Show resolved Hide resolved

src/configs/dataset/hateful_discussions.yaml Outdated Show resolved Hide resolved

Max Hao and others added 3 commits November 7, 2025 04:35

extract embeddings

4c0db21

minor fix

b835e12

minor fix2

70bb7e7

MaxHao56 requested a review from liamhebert November 11, 2025 05:25

Max Hao added 2 commits November 13, 2025 06:41

testing pass

b2f43b2

testing passes v2

96f5317

MaxHao56 marked this pull request as draft November 13, 2025 06:45

Max Hao added 3 commits November 13, 2025 07:05

intial_testing_localact

55ee8d2

intial_testing_localact2

139cae3

uncomment

8b27dbc

liamhebert requested changes Nov 17, 2025

View reviewed changes

MaxHao56 and others added 9 commits November 19, 2025 00:28

Delete logs/train/runs directory

d935531

some reverting

59bdbca

Fix command syntax in run_graphormer script

7a4f72d

Delete src/test_data.embeddings.npy

e4b8db1

Delete src/test_data.json

c25079d

Refactor path setup using rootutils

d5c335a

Replaced manual path setup with rootutils for better management.

Remove comments regarding batch size command

5de8d8d

Removed comments about batch size command validation.

almost final

09e86e4

final_fixes2

3373f2a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

minor update to the repo to ensure running in all machines #14

minor update to the repo to ensure running in all machines #14

Uh oh!

MaxHao56 commented Sep 27, 2025

Uh oh!

liamhebert left a comment

Uh oh!

Uh oh!

Uh oh!

liamhebert left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

liamhebert Nov 17, 2025

Uh oh!

liamhebert Nov 17, 2025

Uh oh!

liamhebert Nov 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	if is_root:
	parent_id = node["id"]

	if node["id"] not in result["id"]:
	node["images"] = node["images"][0] if node["images"] else None

	result["images"].append(node["images"])
	result["distances"].append(node["distances"])

minor update to the repo to ensure running in all machines #14

Are you sure you want to change the base?

minor update to the repo to ensure running in all machines #14

Uh oh!

Conversation

MaxHao56 commented Sep 27, 2025

Uh oh!

liamhebert left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

liamhebert left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

liamhebert Nov 17, 2025

Choose a reason for hiding this comment

Uh oh!

liamhebert Nov 17, 2025

Choose a reason for hiding this comment

Uh oh!

liamhebert Nov 17, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants