Changes between Version 8 and Version 9 of WikiStart
- Timestamp:
- 08/27/2010 08:22:37 AM (14 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
WikiStart
v8 v9 1 = !OpenSubtitles v2 draft specification - Request for comments == 1 = !OpenSubtitles v2 draft specification = 2 3 == Introduction == 4 5 Programming in python use [http://www.python.org/dev/peps/pep-0008/ PEP-8] code practices, [http://django-python.com/python-tutorials/tips-for-readable-python-code/ another good basics] 2 6 3 7 … … 39 43 this should be done also online and via API. 40 44 45 With subtitles store its encoding. Use [http://mirmodynamics.com/post/2008/12/17/Charset-detection-with-python charset detection] 46 For language detection use TextCat, Python [http://thomas.mangin.com/data/source/ngram.py implementation], [http://code.google.com/p/langdet/ langdet], google translate [http://www.catonmat.net/blog/python-library-for-google-translate python lib] 47 41 48 == Movie section == 42 49 43 Implement more than one website for movies, now is implemented only imdb.com , which is not50 Implement more than one website for movies, now is implemented only imdb.com [http://imdbpy.sourceforge.net/ python wrapper], which is not 44 51 bad, but they don't provide any official API access to their database. That's why there is need to 45 implement sites like themoviedb.org and tvseries.org (?). 52 implement sites like [http://www.themoviedb.org/ TheMovieDB.org] [http://github.com/dbr/themoviedb python wrapper here] and [http://thetvdb.com/ TheTVDB.com] [http://pypi.python.org/pypi?%3Aaction=search&term=thetvdb&submit=search python wrapper]. 53 So support 3 sites, imdb.com as last, when 2 other fails (?) 46 54 47 55 Movie hashing - there is little need for stronger hash, which need some research, how to … … 75 83 76 84 Only registered useragents will have API access by using their key. API should be provided 77 by different standards such as XML-RPC (current), REST, JSON... 85 by different standards such as XML-RPC (current), REST, JSON...good example of API is on [http://api.themoviedb.org/ TheMovieDB]. 86 API [http://alexking.org/blog/2009/12/13/api-versioning-tip versioning] is a must. 78 87 79 88 == Caching == 89 90 Cache everything what needs to be cached in memcached :) 80 91 81 92 … … 83 94 == Software specification == 84 95 85 Lighttpd as http server 86 Postgre SQL as database server, Python as programming language, 87 Django as framework, Memcache for memory caching, (research) for subtitle caching, 96 [http://www.lighttpd.net/ Lighttpd] as http server, running FastCGI 97 [http://www.postgresql.org/ Postgre] SQL as database server, 98 [http://www.python.org/ Python] as programming language, 99 [http://www.djangoproject.com/ Django] as framework 100 * [http://github.com/dcramer/django-sphinx Django-Sphinx] 101 * Web Services 102 * [http://code.google.com/p/django-rest-interface/ REST] 103 * [http://code.djangoproject.com/wiki/JSON-RPC JSON-RPC] 104 * [http://code.djangoproject.com/wiki/XML-RPC XML-RPC] 105 * [http://code.djangoproject.com/wiki/GoogieSpell Google Spell] 106 [http://memcached.org/ Memcache] for memory caching 107 [http://sphinxsearch.com/ Sphinx-search] for fulltext search 88 108 89 Sphinx-search for fulltext search90 109 91 110 92 111 == Study == 93 112 94 X-Send-File 95 ako cachovat titule - db, filesystem,...? 113