New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Pull Request for the Algorithm Comparison Tool #4

Open

mihaivavram wants to merge 8 commits into glciampaglia:master from mihaivavram:master

mihaivavram commented Jan 24, 2017

Knowledge Linker Algorithm Comparison tool, taking the output files from linkpred, and running comparison metrics and confusion statistics from any two given algorithms, can print to screen or to files and can be run from the command line


          Knowledge Linker Algorithm Comparison tool, taking the output files f…

9a4f2da

…rom linkpred, and running comparison metrics and confusion statistics from any two given algorithms, can print to screen or to files and can be run from the command line

glciampaglia reviewed

View reviewed changes

scripts/algcomptool/tool/AlgCompTool.py Outdated

		@@ -0,0 +1,317 @@
		#Author: Mihai Avram - [email protected]

		import sys, getopt

Owner

glciampaglia Jan 28, 2017

Please use argparse instead of sys.argv, remove unused getopt import

glciampaglia reviewed

View reviewed changes

scripts/algcomptool/tool/AlgCompTool.py Outdated

+              def computeAndPrint(writeToFile):
+                  #Importing algorithms to be compared
+                  alg1 = pd.read_csv("../input/presidentcouplesNODES.csv")

Owner

glciampaglia Jan 28, 2017

Input file should not be hardcoded into source code. Should be passed as command line arguments instead.

glciampaglia reviewed

View reviewed changes

scripts/algcomptool/tool/AlgCompTool.py Outdated

+              def computeAndPrint(writeToFile):
+                  #Importing algorithms to be compared
+                  alg1 = pd.read_csv("../input/presidentcouplesNODES.csv")
+                  alg2 = pd.read_csv("../input/presidentcouplesRSIM.csv")

Owner

glciampaglia Jan 28, 2017

Same as above

glciampaglia reviewed

View reviewed changes

scripts/algcomptool/tool/AlgCompTool.py Outdated



		#Algorithm comparison
		if noNulls:

Owner

glciampaglia Jan 28, 2017

Instead of enclosing all computations in a if block, exit if there are nulls (e.g. sys.exit(1) to return error status code), and then continue the rest of the computation without the if block:

if there_is_an_error:
    print("The was an error", file=sys.stderr)
    sys.exit(1) 

<NORMAL OPERATIONS>

glciampaglia reviewed

View reviewed changes

scripts/algcomptool/tool/AlgCompTool.py Outdated

+                      #WRITING THE RESULTS TO FILES
+                      else:
+                          fileComp = open("../output/algorithmscomparison.txt",'w')

Owner

glciampaglia Jan 28, 2017

Let user specify output path, do not hardcode file paths.

glciampaglia reviewed

View reviewed changes

scripts/algcomptool/tool/AlgCompTool.py Outdated

+                      else:
+                          fileComp = open("../output/algorithmscomparison.txt",'w')
+                          #Printing Results

Owner

glciampaglia Jan 28, 2017

The code below is duplicated. Use a function instead of copy-pasting the same code block. Code reuse decreases the chances of bugs and make the code more readable.

glciampaglia reviewed

View reviewed changes

scripts/algcomptool/tool/AlgCompTool.py Outdated

		import string


		def main(argv):

Owner

glciampaglia Jan 28, 2017

Use argparse instead of writing your own error reporting.

glciampaglia self-assigned this

Owner

glciampaglia commented Jan 28, 2017 •

edited

Loading

Can you please:

move the contents of the README in a comment at the top of the script and delete the file.
move the script file under scripts, remove algcomptool/tool; too many folders are unnecessary
delete the input and output folders and all their contents. In general DO NOT commit data to the repository unless explicitly needed. Other users in general do not need to download your data, so it's a burden on their connection and storage.

Mihai Avram and others added 4 commits

March 15, 2017 16:34


          Fixed and added some functionality: removed getopt import, used argpa…

358ce22

…rse instead of sys.argv, passed input/output files through command line, implemented sys.exit(1) error calls, created functions to reuse code, fixed file writing bug, and changed the README file


          Update README.md

36f2d7b

Updated README.md to include markdown text.


          Removed output directory and tool directory, kept input files as exam…

eb8c46f

…ple of what input files are expected and they are not so large so I believe they are okay for reusability purposes to keep them there


          Merge branch 'master' of https://github.com/mihaivavram/knowledge_linker

669002a

glciampaglia reviewed

View reviewed changes

scripts/algcomptool/AlgCompTool.py Outdated

+              def main():
+              	parser = argparse.ArgumentParser()
+              	#Required parameters -a1 and -a2 which denote the paths to the two input algorithm files to be compared (print output to the console)
+              	parser.add_argument("-a1", "--alg1file", type=str, help="the relative or absolute location and name of the input file of the first algorithm", required=True)

Owner

glciampaglia May 16, 2017

You can just pass add_argument("alg1file", type=str...)and that will become a positional argument. Positional arguments are required. Also, I recommend using more descriptive names for the user, e.g. input-file-1; if you still want the destination to be called alg1file you may pass dest="alg1file" so that you don't need to change the rest of your code.

scripts/algcomptool/AlgCompTool.py

+              					fileComp.write("Alg1 < Alg2 for relation: " + " ID: " + str(idSubjectMapping[subject]) + " - Subject: " + str(subject.replace("dbr:","")) + "\n")
+              				elif alg1Ratio < alg2Ratio:
+              					fileComp.write("Alg1 > Alg2 for relation: " + " ID: " + str(idSubjectMapping[subject]) + " - Subject: " + str(subject.replace("dbr:","")) + "\n")
+              		fileComp.close

Owner

glciampaglia May 16, 2017

Missing function call?

Author

mihaivavram Jun 27, 2017

Changing all "missing function calls" you mentioned except this one, because this requires a different output format.

scripts/algcomptool/AlgCompTool.py Outdated

+              				else:
+              					fileConfAlg1.write(str(alg1.iloc[sample, idColLocation]) + "," + str(alg1.iloc[sample, subjectColLocation].replace("dbr:","")) + "," + str(alg1.iloc[sample, objectIDColLocation]) + "," + str(alg1.iloc[sample, objectColLocation].replace("dbr:","")) + ",TN" + "\n")
+              		fileConfAlg1.close

Owner

glciampaglia May 16, 2017

Missing function call?

Author

mihaivavram Jun 27, 2017

Changed the fileConfAlg1.write and fileConfAlg2.write procedures to call a function which can write both. In many cases it is possible to include functions; however, we are only repeating an event once or twice, and in my opinion there is no need to further abstract code if we call the block only a few times (i.e. 1-2) but if we call code 3+ times then yes it's a good idea to write a function. Let me know if you agree with this.

scripts/algcomptool/AlgCompTool.py Outdated

+              				else:
+              					fileConfAlg2.write(str(alg2.iloc[sample, idColLocation]) + "," + str(alg2.iloc[sample, subjectColLocation].replace("dbr:","")) + "," + str(alg2.iloc[sample, objectIDColLocation]) + "," + str(alg2.iloc[sample, objectColLocation].replace("dbr:","")) + ",TN" + "\n")
+              		fileConfAlg2.close

Owner

glciampaglia May 16, 2017

Missing function call?

Author

mihaivavram Jun 27, 2017

Are all your "Missing function call?" comments regarding code-structure abstraction i.e. turn each stand-by-itself block under a module function and call that function from main? Or did you mean something else?

mihaivavram added 3 commits

June 27, 2017 18:48


          AlgorithmComparisonTool: Changing input format directive and creating…

7abda19

… function abstraction for confusion output files


          Update README.md

a5f3b3b

AlgorithmComparisonTool: Updating README.md and fixing some formatting problems within this file.


          Update README.md

1e5a61e

AlgorithmComparisonTool: More README.md formatting

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet