1. Posting some methodology in the README might be good, such as any filtering you did or didn't do.
2. You do have some obvious duplicates on there, e.g. `cit` and `citadel`, `jane` and `jane-street`. It's probably not worth the effort to manually clean that up, but I figured I'd mention.
1. Posting some methodology in the README might be good, such as any filtering you did or didn't do.
2. You do have some obvious duplicates on there, e.g. `cit` and `citadel`, `jane` and `jane-street`. It's probably not worth the effort to manually clean that up, but I figured I'd mention.