Permalink
Cannot retrieve contributors at this time
Name already in use
A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch?
genetics-jargon-counter/README.md
Go to fileThis commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
20 lines (15 sloc)
427 Bytes
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Count all the concepts introduced in Genetics 2410. There are over 500! | |
``` | |
cat *.txt | wc | |
525 1266 11355 | |
``` | |
The concepts are burried in PDFs which is typically difficult for computer programs to read. | |
Therefore we first convert the PDFs into XML, and then parse the XML using Python. | |
To do both, at the command-line simply run: | |
```bash | |
make | |
``` | |
Now we can see the list of concepts using: | |
```bash | |
cat *.txt | |
``` |