Welcome Guest ( Log In | Register )

 Forum Rules Joomla Scraper support
 
Reply to this topicStart new topic
> Issues With Greek Characters
mkrokos
post Sep 11 2013, 09:57 AM
Post #1


Newbie
*

Group: Members
Posts: 12
Joined: 9-September 13
Member No.: 1,821



I have succesfully created my first feed with Scraper and ran the cron which worked like a charm. Except a few things:

  • The RSS feed images are too small for my K2 items image. Is there a way to pull the normal article image from the feed source?
  • Some titles (not all) are coming in with their -what I think- html entity names (we are talking about Greek here). So, while some of the imported articles' titles are read OK, others are read like this:
    Deltaiotaalphatheta941tauomega upsilonpiomicronmuomicronnu942 kappaalphaiota 972rhoepsilonxieta gammaiota alphaupsilontau972 kappaalphaiota thetaalpha epsilonpiiotamuepsilon943nuomega
  • Another issue is with the word used for the "read more" link as set at the feed setup. I try to enter the word "Περισσότερα: " but when I save the feed it returns with questionmarks which means that there is an issue with the DB collation. Is it not using UTF-8 for its queries?


there are also issues with the automatically imported tags from the titles, it would be good to be able to create a list of tags not to be inserted. There might be a solution for that already, I must go through the documentation.
Go to the top of the page
 
+Quote Post
Web Design Seo
post Sep 11 2013, 11:44 AM
Post #2


Web Design Seo
****

Group: Root Admin
Posts: 4,156
Joined: 29-April 09
From: Sofia
Member No.: 1



Version of joomla?


--------------------
Правила на форума | Forum Rules | How to receive support. 3D Web Design: Уеб дизайн, Seo оптимизация, Web Site Extensions, Oscommerce Addons, Wordpress plugins and Joomla Extensions. Изработка на уеб сайтове и оптимизация на сайт за търсачки и Seo услуги.
Go to the top of the page
 
+Quote Post
mkrokos
post Sep 11 2013, 11:48 AM
Post #3


Newbie
*

Group: Members
Posts: 12
Joined: 9-September 13
Member No.: 1,821



QUOTE (Web Design Seo @ Sep 11 2013, 02:44 PM) *
Version of joomla?


Joomla 2.5.14
PHP 5.3.27
Go to the top of the page
 
+Quote Post
pavelKukov
post Sep 11 2013, 12:09 PM
Post #4


Php programmer
****

Group: Administrators
Posts: 285
Joined: 26-November 12
From: Bulgaria
Member No.: 1,452



Цитат(mkrokos @ Sep 11 2013, 12:57 PM) *
I have succesfully created my first feed with Scraper and ran the cron which worked like a charm. Except a few things:

  • The RSS feed images are too small for my K2 items image. Is there a way to pull the normal article image from the feed source?
  • Some titles (not all) are coming in with their -what I think- html entity names (we are talking about Greek here). So, while some of the imported articles' titles are read OK, others are read like this:
    Deltaiotaalphatheta941tauomega upsilonpiomicronmuomicronnu942 kappaalphaiota 972rhoepsilonxieta gammaiota alphaupsilontau972 kappaalphaiota thetaalpha epsilonpiiotamuepsilon943nuomega
  • Another issue is with the word used for the "read more" link as set at the feed setup. I try to enter the word "Περισσότερα: " but when I save the feed it returns with questionmarks which means that there is an issue with the DB collation. Is it not using UTF-8 for its queries?


there are also issues with the automatically imported tags from the titles, it would be good to be able to create a list of tags not to be inserted. There might be a solution for that already, I must go through the documentation.


[*]The RSS feed images are too small ...
This is the functionality by default. Extracting bigger copies of those images is very specific to site and rss. You need to switch on Scraper function and to configure it for each site and rss.

[*]Some titles (not all) ...
This looks like usage of special html symbols between words(or in them). Please send as a link to those feeds. You have function to clear these symbols, just switch function on.

[*]DB collation
Tables do not have default collation. It seems that your server collation is different from UFT-8. You can change collation manualy - change it to "utf-8". Table name is "#__aggregator". Replace "#__" with your joomla installation specific prefix.


--------------------
Php programmer in 3D Web Design
Go to the top of the page
 
+Quote Post
ataman79
post May 16 2014, 08:57 AM
Post #5


Newbie
*

Group: Members
Posts: 32
Joined: 26-November 10
Member No.: 399



HI All,

I have similar problem as the Title of this post, but with Cyrillic characters and quotation marks " ".

So I noticed, that when I'm importing RSS feeds, in which the titles contain quotation marks " " , I am getting strange characters in K2 items title , like this:


I set NO to NO Strip special chars in title , but even that I got the same titles.

If I set YES to Strip special chars in title, I am getting title like:
1041108010741096108011031090 11021088108010891082108610851089109110831090 10851072 1062105710501040 106310771088107410771085108010901077 10971077 107410791077108410721090 108310801094107710851079 1072109010721082107210901072 1080107610741072 10861090 1077


Please help me to fix this problem

Joomla 3.3
Scraper - 1.9.4

This post has been edited by Web Design Seo: Jul 15 2015, 02:08 PM
Go to the top of the page
 
+Quote Post
ataman79
post May 19 2014, 07:39 AM
Post #6


Newbie
*

Group: Members
Posts: 32
Joined: 26-November 10
Member No.: 399



Can you help me solving this problem ?
Go to the top of the page
 
+Quote Post
Web Design Seo
post May 19 2014, 02:20 PM
Post #7


Web Design Seo
****

Group: Root Admin
Posts: 4,156
Joined: 29-April 09
From: Sofia
Member No.: 1



New version for Joomla 3 is in progress and will be ready tomorrow. When v.1.9.5 is ready will be checked your issue with special chars. Please, post here or send me PM with link to feed to test.


--------------------
Правила на форума | Forum Rules | How to receive support. 3D Web Design: Уеб дизайн, Seo оптимизация, Web Site Extensions, Oscommerce Addons, Wordpress plugins and Joomla Extensions. Изработка на уеб сайтове и оптимизация на сайт за търсачки и Seo услуги.
Go to the top of the page
 
+Quote Post
ataman79
post May 19 2014, 02:46 PM
Post #8


Newbie
*

Group: Members
Posts: 32
Joined: 26-November 10
Member No.: 399



Цитат(Web Design Seo @ May 19 2014, 02:20 PM) *
New version for Joomla 3 is in progress and will be ready tomorrow. When v.1.9.5 is ready will be checked your issue with special chars. Please, post here or send me PM with link to feed to test.



Hello and thank you for the answer

Here is the rss link:
Код
http://www.sportal.bg/uploads/rss_category_0.xml


Just be sure to import a news with quotation marks " " in the title.
Go to the top of the page
 
+Quote Post
pavelKukov
post May 20 2014, 12:05 PM
Post #9


Php programmer
****

Group: Administrators
Posts: 285
Joined: 26-November 12
From: Bulgaria
Member No.: 1,452



Hello, @ataman79!

Is your server database encoding UTF-8?

I was not able to reproduce your problem with sportal.bg feeds. Please send me login details and FTP access to pavel at 3dwebdesign.org. I will try to find out what is wrong with your installation.


--------------------
Php programmer in 3D Web Design
Go to the top of the page
 
+Quote Post
ataman79
post May 21 2014, 08:36 AM
Post #10


Newbie
*

Group: Members
Posts: 32
Joined: 26-November 10
Member No.: 399



QUOTE (pavelKukov @ May 20 2014, 12:05 PM) *
Hello, @ataman79!

Is your server database encoding UTF-8?

I was not able to reproduce your problem with sportal.bg feeds. Please send me login details and FTP access to pavel at 3dwebdesign.org. I will try to find out what is wrong with your installation.



Hi I sent you the needed data by e-mail, as you requested

Greetings
Go to the top of the page
 
+Quote Post
pavelKukov
post May 21 2014, 10:25 AM
Post #11


Php programmer
****

Group: Administrators
Posts: 285
Joined: 26-November 12
From: Bulgaria
Member No.: 1,452



Hello everybody!
New version of aggregator scrapper for joomla 3. Version number is 1.9.5.1

1. Fixed problem with cyrillic titles containing quotation marks.
2. Fixed problem with cyrillic url alias transliteration in K2.


--------------------
Php programmer in 3D Web Design
Go to the top of the page
 
+Quote Post
ataman79
post May 21 2014, 02:52 PM
Post #12


Newbie
*

Group: Members
Posts: 32
Joined: 26-November 10
Member No.: 399



QUOTE (pavelKukov @ May 21 2014, 10:25 AM) *
Hello everybody!
New version of aggregator scrapper for joomla 3. Version number is 1.9.5.1

1. Fixed problem with cyrillic titles containing quotation marks.
2. Fixed problem with cyrillic url alias transliteration in K2.



GREAT !
Go to the top of the page
 
+Quote Post

Reply to this topicStart new topic
1 User(s) are reading this topic (1 Guests and 0 Anonymous Users)
0 Members:

Collapse

> Similar Topics

  Topic Replies Topic Starter Views Last Action
No New Posts Pinned: Without Support To 4 January 2016
Happy new 2016 Year
0 Web Design Seo 15,433 31st December 2015 - 01:01 PM
Last post by: Web Design Seo
No New Posts Without Support To 6 January 2014
0 Web Design Seo 10,961 20th December 2013 - 03:14 PM
Last post by: Web Design Seo
No New Posts Without Support To 14 August 2013
0 Web Design Seo 12,298 1st August 2013 - 06:25 AM
Last post by: Web Design Seo
No New Posts Without Support To 7 May 2013
Bulgarian national holidays
1 Web Design Seo 10,286 30th April 2013 - 09:46 AM
Last post by: Web Design Seo
No New Posts Without Support To 3 January 2013
0 Web Design Seo 5,604 21st December 2012 - 12:07 PM
Last post by: Web Design Seo
No New Posts Without Support To 28 May 2012
0 Web Design Seo 6,785 24th May 2012 - 11:27 AM
Last post by: Web Design Seo
No New Posts Without Support To 3 January 2012
0 Web Design Seo 6,358 30th December 2011 - 01:49 PM
Last post by: Web Design Seo


 



RSS Lo-Fi Version Time is now: 18th August 2019 - 04:39 AM
Clicky Web Analytics