Skip to content

Commit 3a4e3b1

Browse files
authored
Merge pull request #1 from cxcscmu/haok/refactoring
Haok/refactoring
2 parents 69af5ce + 80d474d commit 3a4e3b1

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

43 files changed

+2892
-247
lines changed

.gitignore

+1
Original file line numberDiff line numberDiff line change
@@ -1 +1,2 @@
1+
wandb
12
**/__pycache__

.vscode/settings.json

-1
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,6 @@
99
},
1010
"[python]": {
1111
"editor.tabSize": 4,
12-
"editor.formatOnSave": true,
1312
"editor.defaultFormatter": "ms-python.black-formatter",
1413
},
1514
"[jsonc]": {

environment.yml

+24-20
Original file line numberDiff line numberDiff line change
@@ -6,25 +6,29 @@ channels:
66
- conda-forge
77

88
dependencies:
9-
- python==3.12
10-
- pip==24.2
9+
- python=3.12.0
10+
- pip=24.2
1111
- pytorch=2.4.0
12-
- pytorch-cuda==12.4
13-
- numpy==1.26.4
14-
- rich==13.7.1
15-
- pyarrow==16.1.0
16-
- mypy==1.10.0
17-
- aiofiles==22.1.0
18-
- aiohttp==3.9.5
19-
- transformers==4.44.2
20-
- pytest==7.4.4
21-
- pytest-asyncio==0.20.3
22-
- types-aiofiles==24.1.0.20240626
23-
- attrs==23.1.0
24-
- wandb==0.16.6
25-
- faiss-gpu==1.8.0
26-
- blobfile==3.0.0
27-
- elasticsearch==8.15.1
28-
- pillow==9.5.0
12+
- pytorch-cuda=12.4
13+
- numpy=1.26.4
14+
- rich=13.7.1
15+
- pyarrow=16.1.0
16+
- mypy=1.10.0
17+
- aiofiles=22.1.0
18+
- aiohttp=3.9.5
19+
- transformers=4.44.2
20+
- pytest=7.4.4
21+
- pytest-asyncio=0.20.3
22+
- types-aiofiles=24.1.0.20240626
23+
- attrs=23.1.0
24+
- wandb
25+
- faiss-gpu=1.8.0
26+
- blobfile=3.0.0
27+
- elasticsearch=8.15.1
28+
- pillow
29+
- seaborn
30+
- sentencepiece
2931
- pip:
30-
- treevizer==0.2.4
32+
- treevizer
33+
- beir
34+
- flash-attn

script/acquire.sh

+12-4
Original file line numberDiff line numberDiff line change
@@ -1,9 +1,17 @@
11
#!/usr/bin/bash
22

3-
# Acquire a compute node with 32 CPUs, 96GB RAM, and 1 A6000 GPU.
3+
# Validate the arguments
4+
if [ -z "$1" ]; then
5+
echo "Usage: $0 <num_gpu>"
6+
exit 1
7+
fi
8+
9+
NUM_GPU=$1
10+
11+
# Acquire a compute node with 32 CPUs, 96GB RAM, and specified number of GPUs.
412
# The node will be acquired for 7 days, and the session will be interactive.
513
# Please run this script with tmux to avoid losing the session.
614
srun \
7-
--partition=long --time=07-00:00:00 \
8-
--cpus-per-task=32 --mem=96GB --gres=gpu:A6000:1 \
9-
--pty bash
15+
--partition=long --time=07-00:00:00 \
16+
--cpus-per-task=32 --mem=96GB --gres=gpu:A6000:$NUM_GPU \
17+
--pty bash

source/dataset/__init__.py

+1
Original file line numberDiff line numberDiff line change
@@ -5,3 +5,4 @@
55
workspace.mkdir(mode=0o770, parents=True, exist_ok=True)
66

77
from source.dataset.msMarco import MsMarcoDataset
8+
from source.dataset.beir import BeirDataset

0 commit comments

Comments
 (0)