why technorati don't want to help me? Mine blog is not updated from 1 month. I have written posts asking help that have been ignore by you. When I've added my answers in posts of others users, all had their problem resolved but not I. Please you want to help also me?
Support
Topic: [closed] PING BUT NOT UPDATED!!!!
-
KilRoY66,
I have some good news and I have some bad news. The good news is, we want to help you :)The other good news is that I think I know what the problem is. When our crawler fetches your page, the gzip encoding is detected as corrupted. This doesn't appear to be a widespread systemic problem on our end, but I can definitely reproduce it with your blog URL. When I fired up a hacked crawler against your blog, it worked, check out http://technorati.com/blogs/blog.tntvillage.org
So, one thing I would suggest, since you're using Apache 2.2 you're probably using mod_deflate in the server. Try configuring your server with this:
BrowserMatch ^Technoratibot/0\.07 no-gzip
(see the docs at http://httpd.apache.org/docs/2.2/mod/mod_deflate.html)
Then ping us after your next update and report back what happens. I'm looking into making the crawler more resilient to corrupted gzip streams but in the interim, this may solve your problem.
thanks,
-Ian -
thanks indeed for your help. I will make the modifications that you have suggested to me and I will inform you for sure, but I've noticed that now the last update was yesterday. Probably for your work on this problem. I will inform you as soon as I will have followed the procedure that you have suggested to me. Again,thanks
-
Hi spidaman
You referred me to this thread from:
http://support.technorati.com/topic/390?replies=19#post-1637Thanks for your reply.
You mention that your crawler detects a corrupted gzip encoding. But currently we are not using mod_defalte on our Apache webserver, so it should not even try to gzip anything. I've tried to telnet to the server on port 80 requesting a webpage in gzip, and as far as I can see it just returns plain text/html.
We recently moved to a bigger server and it's very likely that we where running mod_deflate on our old server. Could it be that your crawler remembers and expect our previous encoding and fails with a "corrupt encoding" error when it expects gzip but gets text/html?
I can see that technorati is now up to date with our feed. Is this something that you've done manually and do we still have a problem?
Once again, thanks for your time :)
-
wa7son,
The updates you and KilRoY66 see on your blogs are because I ran an end to end diagnostic with gzip disabled in the HTTP client, so I'm very confident that that's culprit.By default our crawler sends 'Accept-encoding: gzip' in the request headers. Your server (and blog.tntvillage.org) responds to that with 'Content-Encoding: gzip' (which is correct) and a byte stream that looks like gzip bytes but our library reads it as corrupted. I assumed you were using mod_deflate because it's an Apache 2.2.4 server and mod_deflate (which ships with 2.2.4, but isn't on by default AFAIK) does adaptive response compression.
thanks,
-Ian -
Spidaman,
after 24h with the :
BrowserMatch ^Technoratibot/0\.07 no-gzip
and several manul ping, no update yet.I found in the wordpress option:
WordPress should compress articles (gzip) if browsers ask for them
and I've deactivate also this to see if can help. -
We WIN!!!!!!
Spidaman it has resumed to work immediately after that I have deactivated the option:
[]WordPress should compress articles (gzip) if browsers ask for them
This option is in the wordpress admin panel under
Options - Reading
So, thanks for your help and I suggest for all wordpress users with the update problems to check if this could help -
Hi Spidaman+KilRoY66
My guess is that KilRoY66 is on to something here. I've always had the "WordPress should compress articles (gzip)..." option ON. But I've also had the mod_deflate Apache module loaded.
Now that I'm on a server that does not have the module loaded, I suspect that WordPress tries to do something the the server is not capable of. Bottom line is that this is a WordPress bug - not a Technorati bug.
To spidaman: Sorry that I did not catch this before. In the debug process, I tried to telnet to another domain on the same server that was not running WordPress. That's why I did not get the error you where talking about.
-
No, this was an issue of your blog being errantly flagged. Looks good now, http://technorati.com/blog/myocum.blogspot.com
-
-
Hi spidaman
I don't know if you need any help tracing the issue, but I've been trying to debug the problem.
I'm using the web debugging proxy 'Charles' on OS X to capture the request and response HTTP traffic, and it seems to be able to both read and decompress the gzip encoded response without any issues.
I can't figure out how your crawler breaks on the HTTP response, when I'm able to view the page in Firefox (which should receive the same gzip encoded HTTP response). I've even tried to verify with 'Charles' that the response is in fact gzip'ed AND readable
I can only come up with the following explanations:
1) Either you are requesting a different URL (that IS in fact broken). I'm just requesting http://justaddwater.dk/
2) Or Firefox must be very good at decompressing corrupted gzip data
3) Or your crawler must have some kind of gzip bug that only occurs in certain circumstances (which, I cant figure out). (is there different versions of gzip maybe?)
Btw: I've tracked down how WordPress can send gzip encoded data back, even though the mod_deflate module is not loaded in Apache. WordPress relies on the PHP zlib extension to gzip the response - NOT Apaches mod_deflate (even though WordPress states the opposite on its website. See 'gzip enabled' on: http://codex.wordpress.org/WordPress_Features).
-
Thanks for the info, that really helps!
The crawler is using python's native gzip library (which I believe is also linked against zlib) and I think it is the culprit, other HTTP client implementations I've tested don't have this problem. I plan on implementing a workaround soon, I suspect this is inhibiting a small but not insignificant number of sites from getting indexed.
-
My site has not been updated recently. I tried a manual ping, no help. I tried to send a support request, but the request doesn't seem to go through. I thought I'd try a message here. My site is:
Can someone help me?
-
Hi spidaman
Do you need a test-blog? I need to disable the gzip feature in WordPress at the moment to get Technorati to index. I don't know if you have any other blogs with this issue that you can test against. Let me know if you need any more info or help. You can e-mail me at w-at-tson-dot-dk
-
same here, not updated for 5 days even though I did manual ping.
-
skthew, I see your latest post here:
http://technorati.com/blogs/www.skthew.com
Your ping was processed, but right about the same time you posted this forum entry.
-
Hi, our site has not been updated on technorati for 31 days, yet we post a new vlog every week. We tried pinging but to no avail.
It was working fine before, and we haven't altered anything on our end. Can any one help?
-
Hmmm. I don't believe I have ever had the "gzip" option selected (Wordpress) and am having similar problems (http://www.credoadvisors.com/blog/). The "mod_deflate" suggestion is a bit beyond my tech capabilities (I'm on a shared linux server and haven't a clue how to implement the suggestion). Any thoughts?
-
hi, I am using blogger, http://simpleartsplanet.blogspot.com.
Not very tecnically and IT trained, so I can't quite understand the gzip and I am not using wordpress. I ping and the blog is still not update, showing 260 over days not updated. I update my blog regularly though.
Help, anyone?
-
My sites are not updated by ping either. I use b2evolution and it has a built in ping to technorati ticked whenever I publish a new article. That has no effect. So I came to technorati and used the ping here - still no effect. It says that my sites were last updated 95, 95 and 19 days ago. I have tried this many times over weeks. That is about when they were created.
My sites are visible on this page (just the top three):
http://technorati.com/people/technorati/RayTomes
or as
http://ray.tomes.biz/b2/index.php/a
http://ray.tomes.biz/b2/index.php/b
http://ray.tomes.biz/b2/index.php/c -
Hi,
My blogs are not able to ping and the link counts are wrong.
http://cyberpartygal.blogspot.com/
http://mycrapsheet.blogspot.com/I wrote to Technorati Support since February, 2007 but no reply or solution.
Please help me. Thank you!
Topic Closed
This topic has been closed to new replies.
















