Tell them apart: distilling technology differences from crowd-scale comparison discussions
1. Does the paper propose a new opinion mining approach?
Yes
2. Which opinion mining techniques are used (list all of them, clearly stating their name/reference)?
difftech
3. Which opinion mining approaches in the paper are publicly available? Write down their name and links. If no approach is publicly available, leave it blank or None.
https://difftech.herokuapp.com/ demo open, not open source
4. What is the main goal of the whole study?
to compare technologies with an informative summary of different comparison aspects
5. What the researchers want to achieve by applying the technique(s) (e.g., calculate the sentiment polarity of app reviews)?
build a large database of comparable software technologies by mining tags in Stack Overflow, and locate comparative sentences about comparable technologies with NLP methods, mine prominent comparison aspects by clustering similar comparative sentences and represent each cluster with its keywords
6. Which dataset(s) the technique is applied on?
14,552 comparative sentences for 2,074 pairs of comparable technologies
7. Is/Are the dataset(s) publicly available online? If yes, please indicate their name and links.
No
8. Is the application context (dataset or application domain) different from that for which the technique was originally designed?
evaluation data can be found on https://sites.google.com/view/difftech/
9. Is the performance (precision, recall, run-time, etc.) of the technique verified? If yes, how did they verify it and what are the results?
4 evaluations are performed: Similar technology evaluation , Comparative sentence evaluation (randomly sample 300 sentences), Cluster evaluation (randomly sample 15 pairs of comparable technologies), Usefulness evaluation (check if manually extracted opinions are also identified by approach) A summary can be found at https://sites.google.com/view/difftech/
10. Does the paper replicate the results of previous work? If yes, leave a summary of the findings (confirm/partially confirms/contradicts).
No
11. What success metrics are used?
see 9
12. Write down any other comments/notes here.
-