• BrikoX@lemmy.zip
    link
    fedilink
    English
    arrow-up
    67
    arrow-down
    7
    ·
    edit-2
    1 year ago

    Brave Search fully using their own index since April 27, 2023. But they refuse to identify their crawler and rely on googlebot if sites want to be excluded. Also their search API monetization of possible copyrighted content while understandable is a bit doubious due to their public stance on transparency.

    StartPage also blocks VPN usage.

    DuckDuckGo by their own admission now re-rank “trusted” sites to the top when it comes to what they clasify as"misinformation" so calling their “censorship” mild is huge understatement.

    • Franzia@lemmy.blahaj.zone
      link
      fedilink
      arrow-up
      16
      arrow-down
      4
      ·
      1 year ago

      If I wanted to search for unverified info or misinfo, I could, but almost always I am lookkng for factual and sourced information. Please don’t force me to do otherwise.

        • DigitalJacobin@lemmy.ml
          link
          fedilink
          English
          arrow-up
          2
          ·
          1 year ago

          That’s how all search engines fundamentally work though. The whole point if that they try to bring the most relevant results to the top and downrank things like spam and unhelpful/irrelevant results. Downranking misinfo spam websites isn’t “censorship”. Not ranking resullts would make search engines completely pointless.

          • RaivoKulli@sopuli.xyz
            link
            fedilink
            arrow-up
            1
            ·
            1 year ago

            I’d disagree with equating disinfo with spam. Spam seems easier to classify, sites that try to get ahead by having nonsense keywords or whatever and want to sell you something. Dis- or misinfo is trickier, you need to decide what is correct info. Do you understand what I mean?

    • Atemu@lemmy.ml
      link
      fedilink
      arrow-up
      8
      arrow-down
      1
      ·
      1 year ago

      StartPage also blocks VPN usage.

      Only accidental I think. They have the option of reporting that you’re behind a VPN proxy when it happens.

    • If they pretend to be Googlebot they should be pretty easy to block, Google will let you check the source IP of Googlebot to prevent others from pretending to be it (which on some sites bypasses certain pages; plenty of PHPBB forum out there that will require an account to view threads, except when your user agent is Googlebot). It’s just a matter of doing a DNS lookup, which can be cached and shouldn’t take very long, even for larger sites. A similar method works for Bingbot as well.

      Doing this verification will also kick out tons of other crawlers and bots that you probably don’t want anyway.

      I don’t really see what optimisation of search engines has to do with censorship. Search engine users want answers, they’re not just an SEO API. Without some manual balancing, search engines would be as useless as the second or third page of Google.

      • BrikoX@lemmy.zip
        link
        fedilink
        English
        arrow-up
        4
        arrow-down
        1
        ·
        1 year ago

        They don’t pretent to be googlebot, they use their own crawler they just don’t share the name they use for it, so sites can’t exclude it with robots.txt. They just scrape the same sites that googlebot does, so if the site is excluded by googlebot they also skip it.

    • TrustingZebra@lemmy.one
      link
      fedilink
      arrow-up
      3
      arrow-down
      1
      ·
      edit-2
      1 year ago

      StartPage also blocks VPN usage.

      Ancedotal but Startpage works perfectly fine with VPN for me. Certainly better than Google, which works but requires a lot of annoying captchas.

      • LinkOpensChest.wav@lemmy.one
        link
        fedilink
        arrow-up
        2
        ·
        1 year ago

        I used Startpage for a long time, and I’m perpetually connected to VPN on both my PC and my phone (different nodes at different times)

        Never had a problem with my VPN