Namkoong sees this work as a step toward helping a broader range of researchers and institutions build effective AI systems. By replacing trial-and-error with principled guidance, the framework makes it easier to optimize training strategies in a variety of settings.
“My goal is to help the research community develop tools that are broadly accessible—not only to major technology companies, but to smaller institutions and nonprofits as well,” he says.
“Better foundations does not mean a newer generation of transformers. It means a better way to curate data sets. And in some sense, this is a woefully understudied topic in the research community.”
Infrastructure That Accelerates Research
The project was supported in part by Empire AI, which provided access to high-performance computing resources across New York State. That access allowed Namkoong’s team to dramatically accelerate their work.
“With Empire AI, what would have taken a year or two using our own machines took just 2–3 weeks.”
While infrastructure played a key role in scaling the experiments, the intellectual work—the problem formulation, modeling, and strategy—was developed at Columbia, where Namkoong’s lab is advancing a broader research agenda focused on trustworthy, efficient, and adaptable AI.