IR Thoughts Legacy Posts
IR Thoughts Legacy Posts from 4/28/2007 to 6/27/2006. Please read the following instructions before requesting a copy.
This is a list of legacy posts published at the old home of IR Thoughts. To get the full text of a particular post contact Dr. Garcia at admin@miislita.com. The new blog is available at IR Thoughts (http://irthoughts.wordpress.com). Once a while the full text or portions of a legacy post will be published at the new home. These will be archived in the Legacy Posts category.
4/28/2007 10:19:01 AM Fractals in Information Retrieval: Splog Detection
4/27/2007 9:08:50 AM C-Indices as Co-Retrieval Estimates
4/26/2007 8:19:05 PM Thanks, Poly
4/25/2007 9:57:12 AM W3C AIRWeb 2007 News: Spammers Served with the Papers
4/24/2007 8:04:59 AM On Co-Occurrence and Search Engines
4/23/2007 12:50:59 PM What SEOs are Realizing about LSI Myths
4/22/2007 9:49:13 AM Futile!: User-Machine Relevance Comparisons & SEO
Strength Tools
4/21/2007 10:35:28 AM Building a Client-Side Parser
4/20/2007 9:15:05 PM Presenting at Turin Lab of Polytechnical University
4/13/2007 9:45:25 PM Google Monopoly
4/12/2007 9:28:13 AM Intektel Conference Starts Today!
4/11/2007 4:11:35 PM The Impact of Search Engines on Telecommunications
4/10/2007 2:41:47 PM Being Referenced in an Honour Thesis
4/6/2007 8:19:01 AM ICANN 29th Public Meeting in Puerto Rico
4/5/2007 8:51:59 AM SEOs/Spammers, Beware: W3C AIRWeb07 Accepted Papers
4/4/2007 7:57:05 AM GMAIL: Shootting Your Own Foot
4/3/2007 10:00:13 AM LSA: A Goldmine for Educators, Curriculum Developers, and
School Administrators
4/2/2007 10:24:14 AM Snake Preview of IRW: Intellectual Property Searches
4/1/2007 12:33:01 PM Weekend without Power
3/30/2007 12:32:44 PM The Machine-Users Relevance Perception Gap
3/29/2007 8:34:28 AM Call for Participation in INEX 2007
3/28/2007 12:55:56 PM JavaScript Click Frauds
3/27/2007 3:32:51 PM The Highly Structured Collections Challenge
3/26/2007 8:11:03 AM Invited to the International Technology Conference &
Expo of IntekTel
3/25/2007 4:35:46 PM Testing, Testing, Testing...
3/24/2007 11:02:43 AM Testing my Patent Search Tool
3/23/2007 10:42:59 AM Spammers, SEOs Targeting .EDU Sites with Copy Persuasion
Tactics
3/22/2007 9:15:27 AM W3C2007 Banff Papers
3/21/2007 1:49:37 PM On Typos and Patents
3/20/2007 7:25:03 AM Our Tutorials are Required Readings at University of Maryland!
3/19/2007 10:00:29 AM Mathematics of Language Workshop
3/18/2007 3:19:10 PM Snake Preview of Patentes
3/17/2007 7:48:42 AM The Hate-Love Story of IRs, Spammers, and SEOs
3/16/2007 8:03:02 AM Lecturing at Interamerican University
3/11/2007 8:18:57 AM Back and Some Sad News
3/7/2007 8:52:39 AM Catch me at the Congress
3/6/2007 3:26:40 PM Getting Ready for OjoBuscador Congress
3/5/2007 2:34:27 PM Contextual Data in SERPs with LSI
3/4/2007 10:08:47 AM Udating or Recomputing in LSI? Meet Folding-in Alternatives
3/3/2007 8:56:02 AM LSI: Not a Document Indexing Algorithm
3/2/2007 9:02:22 AM Japan Invitation to the Great Information Navigation
Project
3/1/2007 12:11:47 AM Some Random Notes
2/28/2007 9:31:06 AM Snake Preview of IR Watch 2007-3
2/27/2007 10:24:48 AM Preview of a Patent Search Tool
2/26/2007 9:08:50 AM Closeness, Proximity, Similarity, and Distance
2/25/2007 8:49:20 AM On SEWF, Co-Occurrence, and Free-Riders
2/24/2007 9:01:06 AM Referenced in an ASU Lecture
2/23/2007 8:19:38 AM Being Referenced by a ComSIS Paper that Compares Search
Engines
2/22/2007 10:05:49 AM Free SEO Tip: Using Keywords in Documents
2/21/2007 10:14:56 AM A Term Frequency-Importance Conjecture for SEOs
2/20/2007 10:45:41 AM Entropy Weights and SEO Paranoic Ideas
2/19/2007 9:42:03 AM LSI Term-Doc Matrix: Noise Reduction and Weights
Redistribution
2/18/2007 11:08:46 AM Site Changes
2/17/2007 10:16:55 AM Software Review Requests: A Tale of Two Responses
2/16/2007 9:41:58 AM This Month SIAM Conferences
2/15/2007 9:46:10 AM Top Text Mining Software for Research, Academia, and Marketing
2/14/2007 9:08:50 AM The Problem of Sparsification in LSI
2/13/2007 9:43:01 AM W3C AIRWEB 07 Extended Deadline Announcement
2/12/2007 9:34:07 AM Why SEOs have LSI Backward
2/11/2007 1:35:08 PM Dirac Notation in IR
2/10/2007 10:06:16 AM How to Read Research Papers
2/9/2007 11:33:45 AM Big O Notation Tutorials
2/8/2007 9:02:16 AM Making the Notre Dame Resource List of Markov Chains
2/7/2007 9:21:20 AM Ontologies as Expectations of Co-Occurrence and Homeland
Security
2/6/2007 10:22:12 AM Book on PageRank and Beyond
2/5/2007 2:37:20 PM 1st Euro Workshop on LSI in Techno-Enhanced Learning
2/3/2007 8:36:07 AM Reactions to IRWatch - Newsletter
2/2/2007 9:20:49 AM New LSI work for Updating the SVD Matrix in LSI
2/1/2007 12:03:42 AM IRW-2007-2: Understanding EF-Ratios
1/31/2007 10:41:00 AM Using Local Weights Only
1/30/2007 9:00:39 AM Understanding Local Weights
1/29/2007 10:23:07 AM The Pulse Around
1/28/2007 9:31:59 AM Global Weights with Entropy
1/27/2007 9:27:33 AM Custom Searches in IDP
1/26/2007 8:03:27 AM Call for Participation: 2007 Computer Security Awareness Video
Contest
1/25/2007 9:24:50 AM LSI Ruby Classifier
1/24/2007 12:34:50 PM Snake Preview of IRW - The Newsletter
1/23/2007 1:57:19 PM IPAM: Mathematics of Knowledge and Search Engines Program
1/22/2007 9:23:22 AM Upcoming IPAM Events
1/21/2007 9:13:52 PM The Law of Search Engines Conference
1/20/2007 8:28:24 AM Upcoming Search Engine Conference with Mi Islita
1/19/2007 9:22:14 AM SEO: Science, Snakeoil Marketing or What?
1/18/2007 9:30:56 AM Academic Research Opportunities with Microsoft Live Labs
1/17/2007 10:07:57 AM Dr. Peter Turney and LRA
1/16/2007 10:20:45 AM ResearchChannel: LRA, Microsoft Behind the Code, and more.
1/15/2007 4:54:03 PM Snake Preview of IDP
1/14/2007 3:02:43 PM Update on Pet Project
1/13/2007 12:44:44 PM Infonortics Conference
1/12/2007 9:38:08 AM Research on Web Spam at 2007 W3C AIRWeb Conference
1/11/2007 10:53:07 AM Upcoming SIGIR Conferences
1/10/2007 9:39:52 AM Complete list of SIGIR Awards
1/9/2007 9:31:49 AM IR Videos from SIGIR and RCVL
1/8/2007 5:15:17 PM Back to Business!
1/7/2007 2:31:46 PM Students: Whatever You do, Do not Use Google
1/6/2007 10:35:36 AM Upcoming SIGIR Conferences
1/5/2007 9:53:45 AM Quantum Haystacks
1/4/2007 9:45:11 AM New Directions in Multilingual Information Access
1/3/2007 10:03:06 AM Web2.0: Social Bookmarking Services
1/2/2007 2:26:13 PM Understanding How Users Update their Queries
1/1/2007 9:49:42 AM Starting the Year with IRW
12/31/2006 10:13:57 AM Sneak Preview of IRW
12/30/2006 9:29:17 AM IR Relevancy vs. Suggestion Task Relevance
12/29/2006 1:15:18 PM Top 10 Search Terms from HitWise
12/28/2006 9:37:46 AM How do users reformulate their queries?
12/27/2006 9:47:52 AM Web Search Rankings based on User Behaviors
12/26/2006 4:18:55 PM Query Chains, Anyone?
12/25/2006 9:04:03 AM Does Wikipedia Resemble the Web?
12/24/2006 4:33:33 PM Out for the Holiday.
12/23/2006 10:05:32 AM Google on Link Baiting
12/22/2006 5:29:32 PM Brute Force vs. Query Expansion through Relevance Feedback
12/21/2006 1:55:15 PM A Tag Cloud of Lies
12/20/2006 9:20:12 AM News from Yahoo
12/19/2006 12:21:32 PM Google Top Searches and Keyword-Brand Associations
12/18/2006 9:31:37 AM MSN, LYCOS, and AOL Top Searches and Keyword-Brand
Associations
12/17/2006 4:39:43 PM Some CSS Tips
12/16/2006 9:59:20 AM Some Great News from Google
12/15/2006 10:03:43 AM TMG: MATLAB Tool for Creating Term-Doc Matrices from
Collections
12/14/2006 9:00:04 AM Official Call for Papers for WWW2007 AIRWeb Workshop
12/13/2006 8:51:33 AM UCLA Computer Breach Case and a Paradigm: Universities Never
Learn
12/12/2006 9:48:04 AM On Relevance Feedback, Query Expansion, and Query Chains
12/11/2006 7:45:48 AM Why an Interface Project?
12/10/2006 10:17:29 AM 2007 TREC Conference and Call for Participation
12/9/2006 12:20:34 PM Distributable Multi Search Interfaces
12/8/2006 11:44:20 AM LSI, Term Vectors, and Keyword-Brand Associations
12/7/2006 11:14:00 AM Understanding the Co-Retrieval Space
12/6/2006 11:33:33 AM Why Keyword-Brand Associations are So Important
12/5/2006 9:11:52 AM Keyword-Brand Associations and the Human Brain
12/4/2006 1:47:45 PM IRW-December Issue is Out!
12/3/2006 9:06:55 AM Natural Language Parsing (NLP)
12/2/2006 8:56:01 AM Old Research on Word Associations and Co-Occurrence
12/1/2006 8:45:38 AM User-to-System Normalizations
11/30/2006 11:27:05 AM IRW Dec Issue: Co-Retrieval, Co-Occurrence, Branding and Brain
Activities
11/29/2006 8:27:38 AM Readings in LSA for Cognitive Science and Education
11/28/2006 8:53:45 AM LSI and the Tilde Symbol
11/27/2006 11:08:05 AM A Typical December
11/26/2006 10:04:21 AM Why We Have Removed Alexa
11/25/2006 11:10:29 AM LSI: A Performance-Dimensionality Phenomenon Hypothesis
11/24/2006 3:36:17 PM Analizing MSN Search Funnel
11/23/2006 7:50:04 PM Thanksgiving
11/22/2006 11:36:10 AM TV/Radio Ads and Syntagmatic and Paradigmatic Words
11/21/2006 9:56:33 AM Simplest Way to Compute Cosine Similarities - Part II
11/20/2006 9:20:32 AM Semantic Networks
11/19/2006 8:34:49 AM Thesauri Generation
11/18/2006 8:57:57 AM Clustering Algorithm References
11/17/2006 8:37:43 AM How Many Clusters Are Out There?
11/16/2006 8:56:09 AM ACM Fifteenth Conference on Information and Knowledge
Management
11/15/2006 10:08:24 AM What happened with them?
11/14/2006 9:55:05 AM When LSI is Contraindicated
11/13/2006 8:36:03 AM Cosine Similarities and Web Analytics
11/12/2006 8:42:22 AM Jaccard Coefficients
11/11/2006 4:32:50 PM Simple Matching Coefficients and Consumer Questionnaires
11/10/2006 7:54:53 AM Universities to Offer Courses and Research in Web Sciences
11/9/2006 9:18:52 AM Dice Coefficient
11/8/2006 9:51:59 AM PCA Is Not LSI
11/7/2006 9:01:14 AM Understanding Multidimensional Scaling by Visualizing
Correlations
11/6/2006 9:11:34 AM Program of 2007 OJOBucador Congress
11/5/2006 11:51:43 AM IR Measures and Web Metrics
11/4/2006 8:32:12 AM City-Block and Euclidean Distances
11/3/2006 9:28:58 AM IPAM Upcoming Programs and Workshops
11/2/2006 10:56:26 AM November Issue of IR Watch
11/1/2006 9:33:58 AM Similarity Measures and Search Engine Marketing
10/31/2006 9:04:03 AM Answer to Challenge Question
10/30/2006 11:21:03 AM A Challenge Question
10/29/2006 10:57:04 AM Understanding Disimilarities (Distances)
10/28/2006 11:05:16 AM Clustering Objects with Different Scales
10/27/2006 9:27:38 AM Block-Level Link Analysis Test
10/26/2006 9:10:43 AM When cosine similarities equal dot product similarities
10/25/2006 2:44:38 PM Defining Co-Occurrence
10/24/2006 9:33:09 AM From Term Vector to LSI to PLSI to LDA
10/23/2006 8:03:06 AM Invited to W3C AIRWeb 2007
10/22/2006 1:56:06 PM Important BlueBit Upgrade for SVD Calculations
10/21/2006 1:15:16 PM BlueBit Calculator Gives now Transpose of V directly.
10/20/2006 10:38:55 AM There is No Such Thing as LSI-Friendly Documents
10/19/2006 8:52:48 AM SVD and LSI Tutorial 5 and Fast Track is Available Now
10/18/2006 8:54:37 AM Snake Preview of SVD and LSI Tutorial 5
10/17/2006 9:47:42 AM Hierarchical Domain Structure and Similarity Measures
10/16/2006 8:54:18 AM Karen Sparck-Jones and Stephen Robertson Letters
10/15/2006 10:37:10 AM The IDF Page: Origins of the IDF Concept
10/14/2006 10:05:35 AM SVD and LSI Tutorial 5: Keyword Research with LSI
10/13/2006 9:26:03 AM Two LSI Blogonomies from SEOs
10/12/2006 9:10:29 AM Free SEO Advice: On Experiments from Top N Ranked Results
10/11/2006 10:14:38 AM Two Great Resources
10/10/2006 8:52:48 AM Learning a Distance Metric from Relative Comparisons
10/9/2006 2:29:22 PM A New Study on Distance Metrics as Similarity Measurements
10/8/2006 10:21:35 AM How many Term Weight Formulas are Out There?
10/7/2006 9:35:39 AM On Similarity and Relatedness
10/6/2006 3:02:51 PM The IR Happy Hour
10/5/2006 10:05:25 AM On Keyword Research, Co-Occurrence and Contextuality
10/4/2006 9:07:57 AM Upcoming Articles on C-Indices
10/3/2006 9:49:23 AM Comments on Mike Grehan ClickZ Column on LSI
10/2/2006 8:30:49 AM Which Term Weight to Use?
10/1/2006 10:37:26 AM Sending Out IRW Issue 2
9/30/2006 2:10:37 PM Snake Preview of IR Watch
9/29/2006 9:16:53 AM Microsoft SEO Tools
9/28/2006 9:08:46 AM AJAX and W3C Accessibility Guidelines
9/27/2006 11:58:00 AM Free SEO Advice
9/26/2006 10:52:52 AM More on SEO Forums and LSI
9/25/2006 8:51:38 AM On Crawling and Indexing
9/24/2006 4:56:22 PM What is an Inverted Index
9/23/2006 10:09:21 AM Random Note on SEO Forums
9/22/2006 8:34:15 AM LSI Fast Track Tutorial
9/21/2006 11:08:16 AM LSI Fast Track Tutorial is Next
9/20/2006 12:56:10 PM SVD and LSI Tutorial 4 is Here
9/19/2006 1:49:46 PM Tomorrow is the Day
9/18/2006 9:29:43 AM Beware of "LSI Based" Tools
9/17/2006 9:36:01 AM SVD and Chatroom Surveillance
9/16/2006 11:20:20 AM Snake Preview of SVD and LSI Tutorial 4
9/15/2006 8:18:36 AM Representing Documents and Queries in the Same Space
9/14/2006 8:31:56 AM NAACL HLT 2007 Preliminary Call for Papers
9/13/2006 9:20:58 AM NonNegative Matrix Factorization (NMF) in Chemistry and
Medicine
9/12/2006 9:42:11 AM SVD Fast Track Tutorial
9/11/2006 10:05:06 AM SVD and LSI Tutorial 3 is Available Now
9/10/2006 9:52:04 AM Snake Preview of SVD and LSI Tutorial 3: Computing the Full
SVD
9/9/2006 9:32:18 AM A Model for Computing Context Vectors from Term Co-Occurrence
9/8/2006 11:12:13 AM PhD Thesis: Document ranking using web evidence
9/7/2006 8:18:51 AM Holographic Reduced Representations (HRRs) in IR
9/6/2006 11:34:31 AM PhD Thesis: Understanding LSI via the Term-Term Truncated
Matrix
9/5/2006 9:11:14 AM Latent Semantic Indexing News
9/4/2006 8:47:06 AM Get IR Watch and Play with SVD Code
9/3/2006 10:36:16 AM Launching of IR Watch - The Newsletter
9/2/2006 10:19:24 AM Orthogonal Matrices
9/1/2006 8:59:10 AM Links as Votes of Citation Importance?
8/31/2006 9:20:24 AM High Order Co-Occurrence
8/30/2006 9:45:59 AM Addressing some questions on Term Vector Theory
8/29/2006 9:49:17 AM Upcoming Tutorials
8/28/2006 2:49:16 PM Go Ahead. Make My LSI Day
8/27/2006 11:07:41 AM Simplest Way to Compute Cosine Similarity Values
8/26/2006 8:45:34 AM An LSA Tutorial and Notes on SVD in HPLC Chemistry
8/25/2006 11:00:02 AM Latest SEO Incoherences (LSI)
8/24/2006 2:06:41 PM Tutorials on PCA and LDA
8/23/2006 9:13:25 AM On Logs and Security
8/22/2006 11:14:38 AM Master Thesis: Information Retrieval with Genetic Programming
8/21/2006 10:03:13 AM Fractal Concept Decomposition vs. LSI and Vector Space
8/20/2006 9:08:07 AM Why SEM and Spammers Should Learn LSI How-to Calculations
8/19/2006 9:24:29 AM On Fools and Phonies: How can I know if it is real?
8/18/2006 9:27:49 AM Erdos Numbers from 25 Years of SIGIR and the Diamond
Dozen
8/17/2006 2:04:37 PM Latent Semantic Indexing Resources
8/16/2006 9:08:52 AM Document Collections as Fractal Clusters
8/15/2006 8:30:25 AM University of Chicago Workshop: What to Do with a Million
Books
8/14/2006 10:56:36 AM Snake Preview of IR Watch - The Newsletter
8/13/2006 12:03:05 PM Top 13 Hard-to-Find Applied Mathematics Papers
8/12/2006 11:09:38 AM Updates: LSI Tutorial and NetLogo Software
8/11/2006 3:34:38 PM SVD and LSI Tutorial is Here!
8/10/2006 9:46:43 AM Snake Preview of SVD and LSI Tutorial
8/9/2006 9:53:21 AM NMF and Text Mining Approaches for Email Surveillance
8/8/2006 9:49:58 AM Telcordia Distributed LSI and a Possible Security Risk
8/7/2006 9:00:30 AM Professor Keith van Rijsbergen Wins The Gerard Salton Award
8/6/2006 9:37:13 AM Search Engine Marketers and their LSI Myths
8/5/2006 9:45:47 AM Master Thesis: A Language-Based Approach to Categorical
Analysis
8/4/2006 9:43:58 AM SIGIR 2006 Workshops
8/3/2006 10:48:20 AM SIGIR 20006 Demos and Doctoral Consortium Sessions
8/2/2006 9:41:58 AM This Week is SIGIR 2006
8/1/2006 10:09:21 AM Stanford Group: Workshop on Algorithms for Modern Massive
Datasets
8/1/2006 10:04:13 AM Random Notes
7/31/2006 11:17:56 AM SIAM Call for Tutorials on Data Mining
7/30/2006 11:05:45 AM Prof. Keith Rijsbergen Views On Relevance and Aboutness
7/29/2006 1:29:29 PM Eigenvectors and Reggaeton Music
7/28/2006 2:00:56 PM On Fractals, Multifields, PCA and IRs
7/27/2006 8:59:39 AM Prof. Gene Golub and Two New PageRank Abstracts
7/26/2006 11:36:11 AM Computing accurately ordered PageRank scores
7/26/2006 10:30:01 AM Two SIAM Conferences this Week and a Rumor
7/25/2006 9:00:56 AM IPAM Course: Mathematics of Knowledge and Search Engines
7/24/2006 10:19:19 AM The Myths and Math of SEO - The Interview
7/23/2006 10:23:01 AM Readers Questions on Eigenvalues and Software Tutorials
7/22/2006 12:04:33 PM L-System Fractals and Web Crawlers?
7/22/2006 8:21:45 AM NetLogo Workshop and Argonne National Lab Conference
7/21/2006 11:14:47 AM Software Code for Implementing Term Vectors and LSI
7/20/2006 1:08:16 PM On SVD and PCA: Some Applications
7/19/2006 10:31:18 AM Upcoming Tutorials: LSI (SVD) and Covariance Analysis (PCA)
7/18/2006 11:49:57 AM How about an IR-SEM Event?
7/17/2006 11:39:27 AM Matrix Tutorial 3: Eigenvalues and Eigenvectors
7/16/2006 11:03:12 AM For Data Mining Managers: W3C WWW2006 Data Mining Session
7/15/2006 1:23:54 PM Snake Preview of Matrix Tutorial 3
7/14/2006 10:27:07 AM Matrix Inversion
7/13/2006 11:24:36 AM Release of NetLogo 3D Preview: 3D Worlds for you to see
7/13/2006 10:13:54 AM For SEM Managers: WWW2006 WebMining Session
7/12/2006 4:43:10 PM 2007 Search Engine Meeting from Infonortics
7/12/2006 10:13:58 AM For Web2: WWW2006 W3C Semantic Tagging Session
7/11/2006 9:51:42 AM For SEOs: How they were described at the WWW2006 W3C Web Spam
Session
7/10/2006 1:06:41 PM Demystifying LSA, LSI, SVD, PCA, and IS ACRONYMS
7/9/2006 2:29:12 PM Designing web sites using Term Vector, Text Clustering and
Link Analysis
7/9/2006 1:42:37 PM Two Great Research Papers by Dr. Brian Davison
7/8/2006 12:47:01 PM SEO Blogonomies: The Search Engine Markov Chain
7/7/2006 11:26:55 AM Invited to 2007 OjoBuscador Congress - March 15-16, Madrid,
Spain
7/6/2006 10:15:41 AM Matrix Tutorial 2 - Matrix Operations
7/5/2006 10:26:26 AM NetLogo Upgrade
7/4/2006 11:29:33 AM Happy 4th of July
7/3/2006 4:40:32 PM Matrix Tutorial 1 - Stochastic Matrices
7/3/2006 9:04:30 AM Some definitions
7/2/2006 10:56:53 AM 2006 TREC Tracks
7/1/2006 9:33:10 AM Entrez and NSDL, Two Great Search Resources
6/30/2006 3:15:10 PM What’s Next
6/29/2006 3:16:26 PM Understanding EF-Ratios
6/28/2006 11:58:43 AM Learning The Vector Space Model
6/27/2006 4:14:25 PM Welcome to IR Thoughts