hero_compute --start fails with "Symbolic link loop (os error 40)" #70
Labels
No labels
prio_critical
prio_low
type_bug
type_contact
type_issue
type_lead
type_question
type_story
type_task
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
lhumina_code/hero_compute#70
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Problem
hero_compute --startfails immediately with:This happens on a fresh install (no symlinks in ~/hero) with hero_compute v0.1.3 on Ubuntu 24.04.
Steps to Reproduce
curl -sfL .../scripts/install.sh | bashhero_compute --start --mode worker --master-ip <IP>Symbolic link loop (os error 40)Also fails in single-node mode:
hero_compute --startEnvironment
find ~/hero -type l)Investigation
readlink -f ~/hero/bin/hero_computeresolves correctly (not a filesystem symlink)straceshows the ELOOP originates from inside the Rust binary, not from filesystem symlinksstd::os::unix::fs::symlink()orstd::fs::canonicalize()hitting a path that loopsSuspected Root Cause
The
--startcommand registers services with hero_proc. It may be creating a symlink (e.g. for the binary or a socket) that points back to itself, or callingcanonicalize()on a path with too many indirections.fixed