- Add GPU deployment support with NVIDIA runtime
- Update Dockerfile.allinone with GPU environment variables
- Add comprehensive GPU_DEPLOYMENT.md guide
- Make port 11434 (Ollama) optional for security
- Update DEPLOYMENT.md with CPU and GPU deployment options
- Simplify default docker run commands
- Update healthcheck to only check web application
- Add memory requirements documentation
- Create MEMORY_REQUIREMENTS.md with model comparison
- Add build-8b.sh script for lower memory usage
- Document OOM troubleshooting steps
- Improve Docker build process
- Add BUILD_TROUBLESHOOTING.md for common issues
- Add DISTRIBUTION.md for image distribution methods
- Update .gitignore to exclude large binary files
- Improve docker-entrypoint.sh with better diagnostics
- Update .dockerignore to include ollama-linux-amd64.tgz
- Add backup file exclusions to .gitignore