Add a lightweight dataset loader and at least one metric/evaluation script so this starter can move toward real reproduction.
Expected work:
- folder or json-based dataset loader
- one metric script suited to the task
- example command in README
- clear note that numbers are local experiments until paper protocol is matched
Add a lightweight dataset loader and at least one metric/evaluation script so this starter can move toward real reproduction.
Expected work: