This is a list of news sites and the methods to access them without giving
away personal data or suffering from other things.

The format is what i use with my config
(in the "files/Filters/Configuration Files/" section of Y!G proxlist).
However, with the filters in the first section of this file it should be
possible to do the same with every config.

Most of the faked cookies were gathered by using the account info of the
cool new service at http://bugmenot.com/ .

############ Example filters for other configs

Nothing new here, just a collection for this specific purpose.
All except the last one should work with older Proxomitron versions too.

### Method 1: faking GoogleNews referrer

Create a new list "GoogleNews.txt"

[Blocklists]
List.GoogleNews= "..\Lists\GoogleNews.txt"

[HTTP headers]
In = FALSE
Out = TRUE
Key = "Referer: Fake GoogleNews (Out)"
URL = "$LST(GoogleNews)"
Replace = "http://news.google.com/"

### Method 2: faking Googlebot user-agent

Create a new list "Googlebot.txt"

[Blocklists]
List.Googlebot= "..\Lists\Googlebot.txt"

[HTTP headers]
In = FALSE
Out = TRUE
Key = "User-Agent: Googlebot (Out)"
URL = "$LST(Googlebot)"
Match = "*"
Replace = "Googlebot/2.1 (+http://www.googlebot.com/bot.html)"

In = FALSE
Out = TRUE
Key = "Referer: Kill if pretending Googlebot (Out)"
URL = "$LST(Googlebot)"

### Method 3: blocking cookies

Create a new list "CookieBlock.txt"

[Blocklists]
List.CookieBlock= "..\Lists\CookieBlock.txt"

[HTTP headers]
In = FALSE
Out = TRUE
Key = "Cookie: Block Cookies (Out)"
URL = "$LST(CookieBlock)"

In = TRUE
Out = FALSE
Key = "Set-Cookie: Block Cookies (In)"
URL = "$LST(CookieBlock)"

### Method 4: killing scripts

Create a new list "KillScripts.txt"

[Blocklists]
List.KillScripts= "..\Lists\KillScripts.txt"

[HTTP headers]
In = FALSE
Out = TRUE
Key = "URL: Kill Scripts (Out)"
URL = "$LST(KillScripts)"
Replace = "\k"

### Method 5: faking cookies

Naoko 4.5 only - something similar should be possible with older versions too.
Create a new list "CookieFake.txt" in this format:
www.this_site.com/ $SET(fcookie=THIS_NAME=THAT_VALUE)

[Blocklists]
List.CookieFake= "..\Lists\CookieFake.txt"

[HTTP headers]
In = FALSE
Out = TRUE
Key = "Cookie: Fake Cookies (Out)"
URL = "$LST(CookieFake)"
Replace = "$GET(fcookie)"


############ New/updated filters for my config

Naoko 4.5 only. Remove these obsolete filters:

Cookie: 1 Kill a Cookie     4.01.06 (Option I) [srl] (o.2) (Out)
Set-Cookie: 1 Make Cookies Session only     3.12.08 (Option II) [srl] (d.1) (In)
Set-Cookie: 1 Never accept Cookies     3.12.08 (Option I) [srl] (o.2) (In)

Merge these filters:

[HTTP headers]
In = FALSE
Out = FALSE
Key = "!|||||||||||| 6 Set Flag: Block Cookies by Default     4.05.29 [srl] (o.2) (Out)"
URL = "(^$TST(cookie_b=*))$SET(cookie_b=1)"

In = FALSE
Out = TRUE
Key = "!|||||||||||| 6 Set Flag: Session Cookies by Default     4.05.29 [srl] (d.1) (Out)"
URL = "(^$TST(cookie_s=*))$SET(cookie_s=1)"

In = FALSE
Out = TRUE
Key = "Cookie: 1 Block Cookies     4.05.29 (cch!) [srl] (d.r) (Out)"
URL = "^$LST(CookieList)"
Match = "\0&$TST(cookie_b=[12])(^$TST(keyword=*.fakecookie.*))&$LOG(CRESP $DTM(c) : HDR_Out Cookie killed: \0)&($TST(log=2)$ADDLST(LogMain,[$DTM(d T)]\tHDR_Out Cookie\t\0 \t\u)|)"

In = TRUE
Out = FALSE
Key = "Set-Cookie: 1 Make Cookies Session only     4.05.29 (cch!) [srl] (d.r) (In)"
URL = "$TST(cookie_s=[12])(^$LST(CookieList))"
Match = "?&\0&$SET(scookie=\0)&(\#; expires=[^;]+)+\#&($TST(log=2)$ADDLST(LogMain,[$DTM(d T)]\tHDR_In Set-Cook_exp\t\0 \t\u)|)"
Replace = "\@"

In = TRUE
Out = FALSE
Key = "Set-Cookie: 2 Block Cookies     4.05.29 (cch!) [srl] (d.r) (In)"
URL = "$TST(cookie_b=[12])(^$LST(CookieList))"
Match = "?&\0&$SET(scookie=\0)&$LOG(CRESP $DTM(c) : HDR_In Cookie killed: \0)&($TST(log=2)$ADDLST(LogMain,[$DTM(d T)]\tHDR_In Set-Cookie\t\0 \t\u)|)"

############ Updated list entries for the IncludeExclude list

Complete news site list, some entries are already present.
No need to copy/paste this, just copy the included "IncludeExclude.ptxt"
over the old list.

###---- start IncludeExclude.ptxt ----

# User-Agent: fake Googlebot -- Referrer: remove (gbot)
# -----------------------------------------------------------------------------

# Canton Repository: Bypass Registration
(www.|)cantonrep.com/					$SET(keyword=.gbot.)

# Cox news sites: Bypass Registration
www.accessatlanta.com/					$SET(keyword=.gbot.)
www.ajc.com/						$SET(keyword=.gbot.)
www.daytondailynews.com/				$SET(keyword=.gbot.)
www.journal-news.com/					$SET(keyword=.gbot.)
www.springfieldnewssun.com/				$SET(keyword=.gbot.)
www.statesman.com/					$SET(keyword=.gbot.)

# Cox news sites: Bypass SSL redirect
www.dailyadvance.com/					$SET(keyword=.gbot.)
www.fairfield-echo.com/					$SET(keyword=.gbot.)
www.gjsentinel.com/					$SET(keyword=.gbot.)
www.gopbi.com/						$SET(keyword=.gbot.)
www.lapalmainteractivo.com/				$SET(keyword=.gbot.)
www.lufkindailynews.com/				$SET(keyword=.gbot.)
www.marshallnewsmessenger.com/				$SET(keyword=.gbot.)
www.middletownjournal.com/				$SET(keyword=.gbot.)
www.news-journal.com/					$SET(keyword=.gbot.)
www.palmbeachdailynews.com/				$SET(keyword=.gbot.)
www.reflector.com/					$SET(keyword=.gbot.)
www.rockymounttelegram.com/				$SET(keyword=.gbot.)
www.wacotrib.com/					$SET(keyword=.gbot.)
www.western-star.com/					$SET(keyword=.gbot.)

# Medscape: Bypass Registration
www.medscape.com/					$SET(keyword=.gbot.)

# Silicon Strategies: Bypass Registration
(www.|)siliconstrategies.com/				$SET(keyword=.gbot.)

# Sydney Morning Herald: Bypass Registration
(www.|)smh.com.au/					$SET(keyword=.gbot.)

# TechWeb news sites: Different layout - No/Less ad banners
(www.|)banktech.com/					$SET(keyword=.gbot.)
(www.|)commweb.com/					$SET(keyword=.gbot.)
(www.|)financetech.com/					$SET(keyword=.gbot.)
(www.|)governmententerprise.com/			$SET(keyword=.gbot.)
(www.|)informationweek.com/				$SET(keyword=.gbot.)
(www.|)insurancetech.com/				$SET(keyword=.gbot.)
(www.|)internetweek.com/				$SET(keyword=.gbot.)
(www.|)itprodownloads.com/				$SET(keyword=.gbot.)
(www.|)itutilitypipeline.com/				$SET(keyword=.gbot.)
(www.|)linuxpipeline.com/				$SET(keyword=.gbot.)
(www.|)mobilepipeline.com/				$SET(keyword=.gbot.)
(www.|)networkcomputing.com/				$SET(keyword=.gbot.)
(www.|)networkingpipeline.com/				$SET(keyword=.gbot.)
(www.|)networkmagazine.com/				$SET(keyword=.gbot.)
(www.|)nwc.com/						$SET(keyword=.gbot.)
(www.|)securitypipeline.com/				$SET(keyword=.gbot.)
(www.|)serverpipeline.com/				$SET(keyword=.gbot.)
(www.|)smallbizpipeline.com/				$SET(keyword=.gbot.)
(www.|)storagepipeline.com/				$SET(keyword=.gbot.)
(www.|)techweb.com/					$SET(keyword=.gbot.)
(www.|)transformmag.com/				$SET(keyword=.gbot.)
(www.|)wallstreetandtech.com/				$SET(keyword=.gbot.)

# Telecom.paper: Bypass Registration
www.telecom.paper.nl/					$SET(keyword=.gbot.)

# Washington Post: Bypass Registration
media.washingtonpost.com/		$SET(keyword=.gbot.)
www.washingtonpost.com/			$SET(keyword=.gbot.)

# block cookies			$SET(cookie_b=2)
# -----------------------------------------------------------------------------

# Real Cities news sites: Bypass Registration
www.adn.com/		$SET(cookie_b=2)
www.fresnobee.com/	$SET(cookie_b=2)
www.newsobserver.com/	$SET(cookie_b=2)
www.recordonline.com/	$SET(cookie_b=2)

# block "arrival" redirect
([^.]+.|)ivillage.com/		$SET(cookie_b=2)
# block registration dialog
([^.]+.|)jsonline.com/		$SET(cookie_b=2)

# fake cookies (fakecookie) $SET(fcookie=*) - optionally $SET(append=1)
# -----------------------------------------------------------------------------

# Advance.net news sites: Avoid Zip code box
(www.|)al.com/		$SET(keyword=.fakecookie.)	$SET(fcookie=GTC=:6:000000:::)
(www.|)cleveland.com/	$SET(keyword=.fakecookie.)	$SET(fcookie=GTC=:6:000000:::)
(www.|)masslive.com/	$SET(keyword=.fakecookie.)	$SET(fcookie=GTC=:6:000000:::)
(www.|)mlive.com/	$SET(keyword=.fakecookie.)	$SET(fcookie=GTC=:6:000000:::)
(www.|)nj.com/		$SET(keyword=.fakecookie.)	$SET(fcookie=GTC=:6:000000:::)
(www.|)nola.com/	$SET(keyword=.fakecookie.)	$SET(fcookie=GTC=:6:000000:::)
(www.|)oregonlive.com/	$SET(keyword=.fakecookie.)	$SET(fcookie=GTC=:6:000000:::)
(www.|)pennlive.com/	$SET(keyword=.fakecookie.)	$SET(fcookie=GTC=:6:000000:::)
(www.|)silive.com/	$SET(keyword=.fakecookie.)	$SET(fcookie=GTC=:6:000000:::)
(www.|)syracuse.com/	$SET(keyword=.fakecookie.)	$SET(fcookie=GTC=:6:000000:::)

# IGN/GameSpy: Bypass Ad intercept, as long as they keep that bit there
# - alternate method: $SET(keyword=.gbot.)
[^.]+.gamespy.com/		$SET(keyword=.fakecookie.)
  $SET(fcookie=Raisin=00.0000000000.0000.0000000010000000000000000000000000.0.0)
[^.]+.ign.com/			$SET(keyword=.fakecookie.)
  $SET(fcookie=Raisin=00.0000000000.0000.0000000010000000000000000000000000.0.0)

# Irish Examiner: Registered user
www.examiner.ie/		$SET(keyword=.fakecookie.)
  $SET(fcookie=IISPROTECTLOGIN=SID=ouYppgasPHtrF8qX&PW=XU83R%2BFLwg%3D%3D&USER=crap%40mailinator%2Ecom)
www.irishexaminer.com/		$SET(keyword=.fakecookie.)
  $SET(fcookie=IISPROTECTLOGIN=SID=ouYppgasPHtrF8qX&PW=XU83R%2BFLwg%3D%3D&USER=crap%40mailinator%2Ecom)

# KnoxNews: Registered user
(www.|)knoxnews.com/		$SET(keyword=.fakecookie.)
  $SET(fcookie=role=0102zz!!10862052470000%0A%00%0Escreated%5FdtZZ1086205247ZZ%05%00%17srolesZZZZ8ZZZZKNSGENERALZZZZ%07%00%0Asuser%5FidZZ283814ZZ%00)

# North Jersey: Registered user
(www.|)northjersey.com/		$SET(keyword=.fakecookie.)
  $SET(fcookie=reg=qE%257Baw%252Bb%2560%257Eipb%2526%257FuxW%257Cih.0%2526%2529%2521%2524%253B1%253B.3%2528%257Fwn%257CY%2560svd%2522%252B)

# Salon.com: Become Premium Member "ProxUser" & don't show ads
([^.]+.|)salon.com/		$SET(keyword=.fakecookie.)
  $SET(fcookie=SALON_PREMIUM=SALN_REG%3DY%2CSALN_USERNAME%3DProxUser%2CSALN_SHOW_ADS%3DN)

# The Onion: Bypass Ad intercept
(www.|)theonion.com/		$SET(keyword=.fakecookie.)
  $SET(fcookie=intercookie=; precookie=)

# Times Online: Registered user
www.timesonline.co.uk/		$SET(keyword=.fakecookie.)
  $SET(fcookie=cc=GB; username=dontbugme; gaAuth=ZG9udGJ1Z21lAAAAAABAwg1ZAAAAAEDDXtk5YTFlNGQzYTlmNjI0NzA2ODA2MzIxMGM3MzdlMTE3YwAIAAAAAAAgADY5ZDY5YmFiNWJkYTQwYTQ2YzM2MjVjMjBmZmU4M2Q3N2IxMjZjNzBlYjYxOTA3Mjc2NGI0NDE4ZDc1OGE2N2M%3Dnipd)

# Tribune news sites: Registered user
www.chicagotribune.com/		$SET(keyword=.fakecookie.)
  $SET(fcookie=ti_core=i:6858185|v:4|al:1#wq8DNVl2sY7YwBg042Pe3+iAap/bquG/algdeg8r4rhNmaemuBg8DnVv59IAAgZWKgEjz4FZIfymYZUvuS+wrko56uHIC1cCAkjanJLn+tazuvGL0+qvwQ==)
www.ctnow.com/			$SET(keyword=.fakecookie.)
  $SET(fcookie=ti_core=i:6858185|v:4|al:1#wq8DNVl2sY7YwBg042Pe3+iAap/bquG/algdeg8r4rhNmaemuBg8DnVv59IAAgZWKgEjz4FZIfymYZUvuS+wrko56uHIC1cCAkjanJLn+tazuvGL0+qvwQ==)
www.latimes.com/		$SET(keyword=.fakecookie.)
  $SET(fcookie=ti_core=i:6858185|v:4|al:1#wq8DNVl2sY7YwBg042Pe3+iAap/bquG/algdeg8r4rhNmaemuBg8DnVv59IAAgZWKgEjz4FZIfymYZUvuS+wrko56uHIC1cCAkjanJLn+tazuvGL0+qvwQ==)
www.orlandosentinel.com/	$SET(keyword=.fakecookie.)
  $SET(fcookie=ti_core=i:6858185|v:4|al:1#wq8DNVl2sY7YwBg042Pe3+iAap/bquG/algdeg8r4rhNmaemuBg8DnVv59IAAgZWKgEjz4FZIfymYZUvuS+wrko56uHIC1cCAkjanJLn+tazuvGL0+qvwQ==)

# fake referrer (fakereferrer) $SET(freferrer=*)
# -----------------------------------------------------------------------------
([^.]+.|)fool.com/		$SET(keyword=.fakereferrer.)	$SET(freferrer=http://news.google.com/)
([^.]+.|)telegraph.co.uk/	$SET(keyword=.fakereferrer.)	$SET(freferrer=http://news.google.com/)
(www.|)handelsblatt.com/	$SET(keyword=.fakereferrer.)	$SET(freferrer=http://news.google.com/)
(www.|)smh.com.au/		$SET(keyword=.fakereferrer.)	$SET(freferrer=http://news.google.com/)
www.news24.com/			$SET(keyword=.fakereferrer.)	$SET(freferrer=http://news.google.com/)
www.timesonline.co.uk/		$SET(keyword=.fakereferrer.)	$SET(freferrer=http://news.google.com/)

# Only allow required cookies		(allowcookie) $SET(acookie=*)
# -----------------------------------------------------------------------------

# Jerusalem Post: Bypass Registration -- Alternate method: $SET(keyword=.gbot.)
www.jpost.com/		$SET(keyword=.allowcookie.) $SET(acookie=.cookies.)

# New York Times: Bypass Registration
# -----------------------------------------------------------------------------
([^.]+.|)nytimes.com/auth/login		$SET(keyword=.fakecookie.)
  $SET(fcookie=NYT-S=1gxTGyeBtVQ3bztZQ6.II7M5muiyHTbkcsLltr9RtkrV4zQwUduKv2ZkvLOOdsyetwr.kKeI4T8ezECZTGPGQ/4O9JSHysPoXEl/B6iWk8lHU0)

# Real Cities news sites: Bypass Registration -- Alternate methods:
# $SET(keyword=.gbot.) or add "ONECLICK-LOCKOUT" to "CookieValues.ptxt"
www.charlotte.com/		$SET(keyword=.allowcookie.) $SET(acookie=.ONECLICK-TESTCOOKIE.)
www.contracostatimes.com/	$SET(keyword=.allowcookie.) $SET(acookie=.ONECLICK-TESTCOOKIE.)
www.dfw.com/			$SET(keyword=.allowcookie.) $SET(acookie=.ONECLICK-TESTCOOKIE.)
www.kansascity.com/		$SET(keyword=.allowcookie.) $SET(acookie=.ONECLICK-TESTCOOKIE.)
www.mercurynews.com/		$SET(keyword=.allowcookie.) $SET(acookie=.ONECLICK-TESTCOOKIE.)
www.miami.com/			$SET(keyword=.allowcookie.) $SET(acookie=.ONECLICK-TESTCOOKIE.)
www.ohio.com/			$SET(keyword=.allowcookie.) $SET(acookie=.ONECLICK-TESTCOOKIE.)
www.philly.com/			$SET(keyword=.allowcookie.) $SET(acookie=.ONECLICK-TESTCOOKIE.)
www.twincities.com/		$SET(keyword=.allowcookie.) $SET(acookie=.ONECLICK-TESTCOOKIE.)

# remove referrer (killref)
# -----------------------------------------------------------------------------
techrepublic.com.com/				$SET(keyword=.killref.)
builder.com.com/				$SET(keyword=.killref.)

###---- end IncludeExclude.ptxt ----

############ New list entries for the AliasJump list

Copy/paste everything between "###---- start" and "###---- end"

###---- start AliasJump.ptxt ----

# Belo Interactive news sites: Bypass registration
www.azfamily.com/redir.js	$RDIR(http://local.ptron/sidki/empty)
www.dallasnews.com/redir.js	$RDIR(http://local.ptron/sidki/empty)
www.dentonrc.com/redir.js	$RDIR(http://local.ptron/sidki/empty)
www.fox11az.com/redir.js	$RDIR(http://local.ptron/sidki/empty)
www.guidelive.com/redir.js	$RDIR(http://local.ptron/sidki/empty)
www.kgw.com/redir.js		$RDIR(http://local.ptron/sidki/empty)
www.khou.com/redir.js		$RDIR(http://local.ptron/sidki/empty)
www.king5.com/redir.js		$RDIR(http://local.ptron/sidki/empty)
www.kmov.com/redir.js		$RDIR(http://local.ptron/sidki/empty)
www.krem.com/redir.js		$RDIR(http://local.ptron/sidki/empty)
www.ktvb.com/redir.js		$RDIR(http://local.ptron/sidki/empty)
www.kvue.com/redir.js		$RDIR(http://local.ptron/sidki/empty)
www.mysanantonio.com/redir.js	$RDIR(http://local.ptron/sidki/empty)
www.nwcn.com/redir.js		$RDIR(http://local.ptron/sidki/empty)
www.pe.com/redir.js		$RDIR(http://local.ptron/sidki/empty)
www.projo.com/redir.js		$RDIR(http://local.ptron/sidki/empty)
www.txcn.com/redir.js		$RDIR(http://local.ptron/sidki/empty)
www.wcnc.com/redir.js		$RDIR(http://local.ptron/sidki/empty)
www.wfaa.com/redir.js		$RDIR(http://local.ptron/sidki/empty)
www.whas11.com/redir.js		$RDIR(http://local.ptron/sidki/empty)
www.wvec.com/redir.js		$RDIR(http://local.ptron/sidki/empty)
www.wwltv.com/redir.js		$RDIR(http://local.ptron/sidki/empty)

###---- end AliasJump.ptxt ----

############ EOF

Take your pick :)
sidki
