分类归档: Web

Blog 归来记

这个Blog offline 了足足有一个多月.

站点原来是放在我所在的上一家公司的主机上.因为一些变动,接替我的DBA告诉我空间不能继续用了.当时我正忙得一塌糊涂,根本没时间顾及,好在是把数据备份了出来.

有一段时间都不想继续写了,也是工作比较忙的缘故吧. 幸好 Tigerfish 答应给我一点空间.后来麻烦了 cometrue 不少次,不支持Perl,配置CGI-BIN,配置 Apache,安装DB_File 模块,解决 UTF-8 的显示问题, 总算是把 Blog 重新搭建了起来.其实ITpub本身也提供Blog功能的.可是我自己非常不喜欢这种 Blog 大集中的方式.加上其他种种原因,就放弃了在 ITpub 的那个Blog.

继续阅读

此文作者:, 位于 Web 分类 标签: on .
转载须以超链接形式标明文章原始出处和作者信息及版权声明.

Tom 的经验谈

Tom 是怎么成为一个 Oracle 数据库大师的?在这里有所提及:

it took me 16 years of “all about oracle” to be here. started as a programmer/analyst on the mainframe writing PL/I code against SQL/DS and DB2 (PL/I is still by far the coolest language I’ve ever used). That started my SQL career. I bought Oracle version 5.1.5c from Dr. Dobbs Journal for $99.00 in the late 80’s and started building systems with it. Then I was “forced” to use
Sybase, Informix, Ingress, Gupta SQLBase and other “rdbms’s” as well.

阅读/学习基本理论和掌握产品文档是必要的一步:

I also learned all about databases then — read the theory (like Jim Grays Transaction Processing Book, not really light reading but taught me lots of stuff). Then read the practical (the reference manuals).
So I spent six years as a developer/dba. Even though I was a coder — i found doing things in the database to make tons more sense then writing tons of code to do the same. I actually quit my first job after six years soley for the reason that they wouldn’t let me exploit the software they paid so much for (they were so into this “generic database”, the “database is a commidity”
concept that they had me coding our own two phase commit protocol, our own stored procedure language, our own compilers for the same, our own sql parser — it was frightening. Funny thing is — I meet with customers all of the time
still doing just that!)
the first six years is what really did it. on the job training, reading — tech books (egads — the manuals!) lots of magazines (i was a magazine junkie, dr dobbs, the c programmers journal, you name it).

继续阅读

此文作者:, 位于 Web 分类 标签: on .
转载须以超链接形式标明文章原始出处和作者信息及版权声明.

Some Hints for Nutch

好久没关注Nutch了,看邮件列表,学到了几招关于 Nutch 的小技巧.

  • 如何索引动态 URL 站点?

    调整 regex-urlfilter.txt 或是 crawl-urlfilter.txt 文件. 参见行”# skip URLs containing certain characters as probable queries,后面的内容.

  • 编译 Nutch 需要用到的 Ant 版本至少要 1.6 以上.

继续阅读