code cleanup tool that:
- looks for comments (AST parser?)
- classifies the commented text as either code of natural language
- simple knn classifier
- some small test corpus vs surrounding code fragments
- if a contiguous comment block contains only code, delete it
- need to avoid markdown
- how to prompt user approval?