I have googled for to find a topic of a site and end with delicious.com but that is not that kind of helpfull..is there any other way to find it?? I am not talking about meta tags or description but catagory of a site...any Ideas??
I hate to give an answer to such a sloppy and carelessly written question, but nonetheless...
If you are looking to categorize a specific website, try typing this into google site:http://www.google.com/dirhp?hl=en along with the website you want to categorize and see if you get any results.
If you question is to find websites in a certain category, then try google's categorized directory. It's different than a normal web search.
and... If this answer helped you, please click the checkbox next to it.
Related
still a newbie so I have a bunch of questions which might seem silly. However, here I go:
as a project to learn python, I decided I would start with something simple. The idea I had was to go to craigslist and find the prices of motorcycles, go through the first 10 pages of each city and give a mean, median and price trends. the result of that work can be found here (http://craigslistcompanion.co.nf/) as well as the codes.
now in order to make things better i decided that i should let the user input what they want (motorcycle/ car/ bikes) and from where (sfbay/ austin etc) and get the same results as shown in that page. as i started looking into it i was told that learning django would be a good place to start and i have been trying to do that for the last couple of days. however, what i have learn so far is mostly setting up databases and handling them.
I was wondering if there is a tutorial somewhere which will show me how to take input from the user and then manipulate it. i want to do this completely using python/ django.
for example lets say i want the user to input their first and last name and then try to play with that string and output something. how would i go about doing that?
thanks
First some clarification. Do you want to develop a web interface for your users? If so, then Django is a great start. The Django Book is a very good introduction to Django and teaches you with examples. The task of getting user information, validating it, manipulating it and saving it in a database is the most basic of task and it is explained well in the book.
The next thing is web scraping craigslist. From how you structured your question, it seams that you have solved that part of the problem. right?
I am trying to disallow certain parts of a site instead of the whole thing.
I am relatively new to this so if someone walk me through it I will appreciate it.
I know that you can Disallow: /page1.cfm from crawlers but what if I want to disallow just a part of that page such as a link or a contact form that exists on that page? Is this functionality even possible?
Based on some forums I was reading recently, the "nofollow" function is not really working anymore since the crawlers are becoming smarter. (i don't know how credible that forum was, so if anyone has a better source please share)
Any suggestions?
Don't use nofollow, you'll loose linkjuice on your page.
Robots.txt are just a hint for crawler, half of the time with disalow rules if they already have find the page they still visit it and index it.
Use .htaccess rules to ban or block the access to this pages.
Or crypt your link with a complex .js (base64_encode() + str_rot13() encoding should be enough to lost the crawler)
You can use the attribute "nofollow" in the meta tag to hide the information on the page. Google wrote that they do not pass on the links labeled "nofollow." More information about this and examples you can find here:Robots.txt tutorial and Google supportHope this helps
I'm using the comments box on my site to get more users socially involved and at the same time I'm reading the Graph API to print those comments on the same site to make it crawlable for search engines. Problem is now that for a real user the comments show up twice (through my graph api print AND through the actual fb:comments plugin box).
Any ideas how I could solve this ?
Would prefer to show only the comment-field from FB and use my own echo to show the comments as this way enables it to REALLY moderate the comments (not just set it to limited visibility). Thanks for any suggestion!
I found some sort of solution for my problem. It's not perfect, but the closest one can get to what I wanted:
Instead of directly returning the query from the Graph API, I attached a script to the comment.create event. This script writes the new comment into my database, where I can moderate again. This way I have to double-moderate, but therefore I don't have any unwanted comments on my site.
I also found out that with the old deprecated comment-box you can set the comments that get natively shown by facebook to 0. That way only the already moderated comments get shown by my own script coupled with my database. It works the way I want it, even for friends of the unwanted commenter.
Hope that helps someone else too!
I am looking for a blogging and comments system that can smoothly integrate with my Django sites. I've found there is a lot on the Net and got lost a bit, and I don't have much experience on this. Hope you guys can give me some suggestions.
Here are the things that I would like to have:
Tag Clouds
Articles Archive (by months/by years)
Articles Rating (e.g. with Stars or customize icons)
Comments to the particular Topic/Articles
Sub-Comments of a particular comments (i.e. following up comments)
Blogs/Articles Searching
Able to relate other articles that is relevant (i.e. follow up Articles)
Pagination of the comments if get too long
OpenIDs supports (e.g. facebook, hotmail, blogger, twitter...etc)
Support login before user can comments
Able to retrieve Blogs' Header and customized the display order
Able to subscribe this article to RSS
Able to Email this to friends (this may not belongs to the comments system)
If I missed some common functions, please let me know. The comments system I am looking for should do most jobs that those popular comments system should do on the web, e.g. WordPress.
Thank you so much everyone. Have a nice day.
I myself really like django-threadedcomments. It supports threaded commenting like what you would see in Disqus.
i heard django-comment-utils is quite good. - may you test it :)
I'm working on a project that would be greatly optimized if I am able to restrict a Google, Yahoo or Bing/Live search to just the < meta content="xyz" name="description" /> tag. I read all the help pages and api docs and can't seem to find a way to do it, and I thought I'd give it a shot here on Stack Overflow.
Thanks :)
Unfortunately none of the big 3 support this sort of limitation. I tried a similar search a while back limited to tags, but had no luck.
If you are trying to find web pages that match a description exactly, here is a solution.
Find a major website that reports "website stats" for third parties.
Example www.sitebrain.com
Then use the site: operator in google to search that site for a list of pages that report on sites including your target descriptions.
Example site:statbrain.com + Get product information, support, and news from Microsoft.
The resulting Google results will be a list of pages all matching the description.