Open Phantom is a fully open-source implementation of the approach described in the paper "[Phantom: Training Robots Without Robots Using Only Human Videos.](https://phantom-human-videos.github.io/)" This project focuses on the data collection component of the Phantom pipeline, enabling anyone with a standard RGB camera to generate training data for robot learning without requiring actual robot hardware.
* Performance optimizations needed for real-time processing
## Background
The original Phantom paper demonstrated that robots could learn tasks from human demonstrations without any robot-specific data collection. By capturing hand movements in diverse environments and converting them to robot action parameters, it's possible to train robot policies that perform effectively during zero-shot deployment.
Unlike the original implementation which relies on [MANO](https://mano.is.tue.mpg.de/index.html) (a hand model not available for commercial use), Open Phantom is built entirely with open-source components that can be used in commercial applications.
git checkout -b fix-issue-42 # Use descriptive names
```
### Set Up Pre-commit Hooks
```bash
# Install pre-commit if you don't have it
pip install pre-commit
# Set up the git hooks
pre-commit install
```
Pre-commit will automatically check your code style, formatting, and other quality standards before each commit. If any checks fail, it will prevent the commit and show what needs to be fixed.
### Make Changes
* Write clean, commented code
* Keep commits focused and with clear messages
* Test your changes thoroughly
* When you commit, pre-commit hooks will run automatically
### Submit a Pull Request
* Push your branch: `git push origin fix-issue-42`
* Create PR on GitHub pointing to `main` branch
* Include clear title (e.g., "Fix #42: Improve Hand Tracking")
* Briefly describe changes and reference related issues
This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.
## Acknowledgments
* This project is based on the research presented in "Phantom: Training Robots Without Robots Using Only Human Videos"
* We use Meta's SAM2 (Segment Anything Model 2) for hand segmentation
## Disclaimer
Open Phantom is a community implementation focused on the data collection aspects of the Phantom approach. The original paper authors are not affiliated with this specific implementation.